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(57) Abstract 



The invention relates to a nucleotide sequence which comprises a sequence involved in the expression of the human Fatty Acid 
Transport Protein (hFATP) comprising the aminoacid sequence of the Figure. 
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Nucleotide sequence expressing human Fattv Acid Transpo rt Protein and 
corresponding aminoacid sequence. Use for the regulation o f fattv acids 
metabolism. 

The invention relates to nucleotide sequences involved in the expression 
of the human Fatty Acid Transport Protein (hFATP) and relates to the Fatty 
Acid Transport protein. 

Fatty acids, especially long-chain fatty acids, are the structural 
components of several classes of lipids;- they represent an important energy 
source, particularly with respect to glucose, for various tissues or organs. Fatty 
acids furthermore participate in several cellular signaling processes. 

Excessive intracellular accumulation of fatty acids due either to errors in 
metabolisms or to increased supply can have severe pathological 
consequences. Furthermore, the involvement of fatty acids in several diseases 
such as insulin resistance and coronary artery disease made the inventors 
focuss research efforts on the mechanisms which control the homeostasis of 
these lipid moieties. 

Interestingly, fatty acids are ligands and modulators of transcription 
factors, which control their own metabolism. This emphasizes the fact that the 
various metabolic pathways are not only controlled by substrate supply, but 
also by the participation of substrates and metabolic intermediates in regulatory 
phenomena. 

Intracellular Fatty Acid (FA) concentrations are in part determined by 
proteic regulators, in particular by regulating import/export system that is 
controlled by several proteins including Fatty Acid Transport Protein (FATP) 
and acyl-CoA synthetase (ACS). 

Fatty Acid Transport Protein (FATP) was first isolated in the mouse in 
1994 by a functional cloning approach (Schaffer et al, 1994). 

The inventors have now isolated and, characterized the' human FATP 
coding sequence and the gene comprising said coding sequencer They have 
furthermore identified the chromosomal location of the FATP gene, in a region 

CONFIRMATION'COPY 
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implicated in several metabolic diseases, suggesting that FATP is part of an 
important- group of synthenic genes. hFATP is expressed in several organs 
(heart, brain, liver) and tissues (adipose tissue, muscle) and is especially highly 
^ expressed in .insulin-sensitive tissues. Regulatory studies suggest that its 
expression is under the control of several transcription factors including PPAR 
(Peroxysome Proliferator Activated Receptor) (Martin et ai , 1997). Especially," 
Martin G. et al., 1997, have shown that the expression of Fatty Acid Transport 
Protein is regulated by PPARa and PPARy activators, such as fibrates or 
antidiabetic thiazolidinedione. The inventors have now determined that the 
expression of the hFATP gene is also under the control of the Retinoid X 
Receptor (RXR) known to transduce the effects of retinoic acid on gene 
expression and involved in the regulation of lipid and glucose metabolism. 

Therefore, FATP is an important target for therapeutic agents used in the 
treatment of several pathological states resulting from anomalies in regulation 
pathways of expression, especially of transcription or translation of the gene 
coding for FATP, either directly or through transcription factors. 

The invention thus, relates to genetic means and especially to nucleotide 
sequences involved in the regulation pathway of the long-chain fatty acid 
metabolism and therefore makes available new compounds appropriate for the 
definition of therapeutic means useful for treating pathological disorders related 
to the metabolism of long-chain fatty acids. 

An object of the invention is therefore a nucleotide sequence which 
comprises, a sequence involved in the expression, of the human Fatty Acid 
Transport Protein (hFATP) comprising the aminoacid sequence of Figure d or 
Figure 5. 

The nucleotide sequence of the invention can be any type of sequence 
including any DNA especially genomic DNA, synthetic DNA, RNA and 
especially mRNA, said sequences being sense or antisense sequences. 

Therefore, the invention relates to these sequences when they are 
independently of the conditions used to obtain them, thus including sequences 
either extracted from a biological sample, or cloned or synthesized including by 
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enzymatic or chemical processes. 

By the expression "a sequence involved in the expression", it is intended 
within the* present invention a sequence coding for the Fatty Acid Transport 
protein (FATP) .and especially human FATP, or a sequence involved in the 
regulation of the steps required to express the FATP gene, especially the 
hFATP gene, or both sequences operably linked. Said regulation sequences 
can be derived. or are derivable from the native genomic sequence of the FATP 
gene, especially hFA TP gene. 

Regarding ; sequence coding for the FATP, especially hFATP, the 
invention pertains to a nucleotide sequence which is capable, when placed 
under the control of appropriate regulation elements, especially regulation 
nucleotide sequences, prompter, enhancer, transcription sites, to be 
transcribed and under appropriate, conditions to be translated into an amino 
acid- sequence. Said aminoacid sequence can then possibly be processed, 
depending on the expression system chosen. 

Especially, the human Fatty Acid Transport Protein which is expressed is 
characterized by the amino acid sequence that it comprises. Depending from 
the cellular host chosen for its expression, the structure of the obtained hFATP 
can vary, especially as the result of a maturation process, or of environmental 
conditions. 

According to a specific embodiment of the invention, the nucleotide 
sequence comprises, a sequence encoding the human FATP, corresponding to 
or comprising the nucleotide sequence having nucleotides 1 to 2222 of one of 
the sequences of figure 1 , figure 2 or figure 3. 

According to a preferred embodiment of the invention, the nucleotide 
sequence encoding the human FATP at least comprises the f open-reading 
frame (ORF) corresponding to the human FATP represented on figure 2 or 
figure 5. Alternatively it comprises any -fragment of said ORF, coding for a 
polypeptide having the functional properties of the human FATP as far as the 
transport of the long-chain fatty acids is concerned. The coding sequence of the 
human FATP gene is described on figure 1 . figure 2 or figure Z. 
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Such a nucleotide sequence can be placed under the control of the 
native regulation sequences, or part thereof, present in the. gene and enabling 
the expression of hFATP or can be placed under the control of heterologous 
regulation sequences according to well-known procedures. ■* 

According to another embodiment, the invention relates to a nucleotide 
sequence hybridizing in stringent conditions, with a probe comprising 50 to 
2000 bp, preferably 50 to 300 nucleotides especially around 200 bp, said probe 
including at least 6, preferably 9 continuous nucleotides from the following 
sequence : CGGGGAGACGGGACGTGAAGGG; 

As an exiample, a sequence which is advantageously used to provide for 
a probe appropriate to selectively hybridize with the nucleotide sequence of the 
invention comprises at least some nucleotides contained in the 5'* non-coding 
sequence upstream from the ATG codon of the. coding sequence of the gene 
expressing the hFATP. 

A probe replying to this definition is considered to be specific for the 
human gene encoding the hFATP or derived genomic nucleotide sequences, 
expressing the human FATP. Thus, it does not hybridize with murine, rat or 
yeast cDNA sequences. Such a probe can be an amplification product resulting 
from amplification with primers wherein one primer is specific for the human 
hFATP gene, said primer comprising at least 6, preferably 9 nucleotides 
selected from the above nucleotide fragment. / * 

The second primer can be taken in any region of the gene and especially 
in the sequence comprising nucleotides^ to 2222 of the sequence of figure 1, 
or figure 3. ■ ., • 

The stringent hybridization conditions used according to the ^invention 
can be defined with respect to the following parameters, referring to 
hybridization of a DNA probe, especially a cDNA probe with a total RNA or 
DNA: ■.'■-■>*' ' 

- hybridization at 42°C, 

- 2 washing steps at 42°C for 10 minutes in 0.5 x SSC. followed by 2 
washing steps at 65°C for 30 minutes in 0.5 x SSG. ' 



WO 99/51740 



5 



PCT/EP99/02295 



Details regarding these conditions can be found in the examples. 

According to a specific embodiment of the invention, the nucleotide 
sequence is a genomic DNA sequence and especially is the gene encoding the 
human FATP, 

5 According to another embodiment of the invention, the nucleotide 

sequence comprises a transcription initiation site 61 bp upstream from the ATG 
codon. Especially, the nucleotide sequence comprises the sequence coding for 
the hFATP and the 5' region of the gene containing regulation sequences 
including the transcription initiation site. 

o The structure of the sequence of the gene between the ATG and TGA 

codons, can be represented as follows: 

794 
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ATG " ' TGA 

The respective size of each exons illustrated on figure 8 starting from position 1 



at the ATG codon is the following: 

exon 1 (167 pb), exon 2 (395 pb), exon 3 (162 pbj, exon 4 (70 pb), exon 5 (92 
pb), exon 6 (111 pb), exon 7 (209 pb), exon 8 (127pb), exon 9 (138 pb), exon 
10 (165 pb). exon 11 (147 pb), exon 12 (158 pb). 

The introns present between said exons, have respectively the following 
5' and 3' ends: intron 2 (5' gtgaggcc.gaccacag 3'); intron 3 (5' 
gtgagtca...tcctgcag 3*); intron 4 (5* gtgagggg... cccctgcag 3'); intron 5 
(5' gtactacg...ctctgcag 3'), intron 6 (...cgtcccccac 3'); intron 7 
(5* gtgcacacc.cattccag 3'); intron 8 (5* gtgagcag...ctccctag 3'); intron 9 (5' 
gtgcgcag...tctgccag 3'); intron 10 (5* gtcaagcL.gcctccag 3'); intron 11 (5* 
gtgcgagt...cactatag 3'). Intronic regions of a -hFATP gene according to the 
invention are represented on figure 4 which discloses the genomic sequence 
encoding the human FATP. 

Advantageously, the nucleotide sequence according to the above-given 
definitions comprises further sequences involved in the regulation of the 
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expression of the gene coding for the human FATP, present in the non coding 

5' and/or 3' regions of the gene. 

The sequences involved in the regulation of the expression of the gene 
include sequences containing the promoter region, the transcription initiation 
site and other regions involved in the activation of the expression or in 
enhancement thereof. 

Regulation of the expression also involves the regulation of the 
translation of the gene and the corresponding sequences. 

Variants of the above-defined nucleotide sequences comprise a 
sequence which either specifically hybridizes in high strigency conditions with 
primers having the following sequences primer 1 
AAGGTCAATGAGGACACAATGG (sense), primer 2 

CGAGTAGGTAGTGATCGTGCAG (antisense), or is the amplification product 
obtained with the following sequences, or hybridizes in high stringency 
conditions with said amplification product. 

Another variant of the sequence is a genomic sequence coding for 
hFATP and which hybridizes in high stringency conditions with a probe 
containing around 200 pb, said genomic nucleotide sequence preferably 
comprising a sequence involved in the regulation of the gene coding for the 
hFATP. 

A specific probe containing around 200 bp' which is capable of 
specifically hybridizing with a genomic sequence coding for FATP can for 
example be found in the sequence of exon 2. 

Another nucleotide sequence according to the invention is the mRNA 
which is a sequence obtainable by transcription of a genomic nucleotide 
sequence coding for the hFATP and replying to the above definition. 

The mRNA sequence obtainable from the gene coding for the hFATP 
- comprises the nucleotide sequence complementary to the sequence of figure 1 , 
or figure 3, having 2 kb. Preferably, it is a sequence including a nucleotide 
sequence of 4.4 kb capable of hybridizing in high stringency conditions with the 
sequence of figure 1 , figure 2 or figure 3. 
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The invention also specifically relates, to the cDNA as obtained by 
reverse transcription of the mRNA defined above. Such cDNA sequence 
comprises the sequence of figure 1 or figure 3 containing nucleotides 1 to 2222 

or the sequence of figure 2. 

The invention also concerns polynucleotides selected among the 

following sequences: 

- AAGGTCAATGAGGACACMTGG. 
• CGAGTAGGTAGTGATCGTGCAG, 

- a sequence comprising or corresponding to sequences involved in the 
regulation of the expression of the gene encoding the hFATP, 

- any fragment of a nucleotide sequence defined above especially 
derived from the sequences disclosed in figures 1, 3 or 4, for instance by 
deletion mutation or insertion provided the essential biological properties of the 
native sequences are maintained, specific for the hFATP gene, including 
fragments that can be used as primers for amplification reactions, or a probe for 
hybridization. The above cited biological properties are described in the 

examples which follow. 

A further object of the -invention is a vector, for the cloning or for the 
expression of a sequence defined above, said vector comprising, inserted in 
site not essential for its replication, a nucleotide sequence as defined above. 

Any appropriate regulation sequence, including heterologous sequences 
with respect to the hFATP gene, and especially nay appropriate promoter, can 
be used for the expression of the nucleotide sequence of the invention cloned 
into the vector. The selection of these regulation sequences can depend upon 
the cell host which is used to carry out this expression. 

Especially, the vector is a plasmid or a phage. Advantageously, it further 
comprises a reporter sequence such as the CAT gene or the sequence of the 
luciferase gene. 

The sequence of the reporter gene which is present in the vector can be 
under the control of an heterologous regulator region especially under the 
control of the regulation sequences of the gene encoding the hFATP. 
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The invention also relates to recombinant cells especially procaryotic 
cells or eucaryotic cells and advantageously insect or mammal cells. 

The invention is also directed to the human Fatty Acid Transport Protein 
comprising the amino acid sequence of figure ,2, or figure 5 or to any 
5 polypeptide fragment having the properties of FATP regarding intracellular 
transport of fatty acids, or to fragments recognized by antibodies directed 
against the human FATP. 

The hFATP has a calculated molecular weight of 71 kDa and an 
apparent molecular weight of 63 kDa in Northern Blot analysis. 
io According to a specific embodiment of the invention, the hFATP has an 

isoelectric point of 8.5 and replies to the amino acid sequence of figure 2 or 
figure 5.' According to another embodiment, this sequence is encoded by a 
nucleotide sequence as defined above. 

Advantageously, said human hFATP is processed and especially devoid 

15 of its signal peptide. 

The hFATP of the invention can be either under its glycosylated form or 

can be devoid of its glycosylate groups. 

The invention also relates to antibodies specifically directed against 

hFATP. 

20 In view of the various properties which have been identified with respect 

to the FATP in the regulation pathway of the long-chain fatty acid, the invention 
provides means that can be used in therapeutic compositions especially for 
modulation of the intracellular level of long-chain fatty acids, which comprises a 
component capable of regulating the expression of the gene encoding hFATP. 

25 Alternatively, the composition of the invention can be used for the 

therapeutic modulation of the blood level of long-chain fatty acids. 

' The inventors have especially found out that the involvement of human 
FATP in the modulation of the metabolism of fatty acids, especially long-chain 
fatty acids, can provide a way to treat pathological states related to various 

30 diseases and for example to obesity, cardiomyopathies and diabetes, 
specially diabetes non-insulino dependent. 
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According to the invention, the expression "treatment" relates to the 
capacity of a composition or compound, to prevent the occurrence of a 
pathological state or to control such pathological state or to improve the 
condition of a patient suffering from a pathological state, where this pathological 
state is the result or involves an abnormal regulation of the long-chain fatty 
acids metabolism. 

Especially, the invention relates to such a composition which comprises 
an inhibitor of the expression of the FATP gene, for instance for the treatment 
of obesity or related diseases. V. 

Alternatively, the invention relates to a composition which is capable of 
enhancing the expression of the hFATP gene for the treatment of 
cardiomyopathies, diabetes or related diseases. 

The invention further provides means enabling assaying the capacity of 
chosen compounds to have an agonist or antagonist activity with respect to the 
expression of FATP, especially human FATP. in various tissues. Such assay 
can hence comprise the steps of: 

i contacting a culture of cells of a specific tissue, said cells containing the 
hFATP gene, with a tested compound; 

- detecting the effect of said compound on the expression of the FATP 

20 gene. 

The invention also relates to a method for the screening of the 
expression of the hFATP protein in determined cells when these cells have 
been contacted with a determined compound, which method comprises the 
step of detecting the transcription of the mRNA in the cells or cell extracts. 
25 In a specific embodiment of the invention, the screening method 

comprises 

a) measuring the level of transcription of the mRNA in cells or cell 
extracts, wherein the cells have previously been contacted with the determined 
compound, in conditions enabling the interaction of said cells and said 

30 determined compound; . 

b) measuring the level of transcription of the mRNA in the same cells 
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species as in step a) or on extracts of these cells, wher in these cells have not 
been previously contacted with the assayed compound; 

c) comparing the level of transcription obtained in steps a) and b). 
\ Preferably the cells which are, used to carry out the above screening 
5 methods are selected among the group of live cells, heart cells, adipose tissue 
cells, skeletal muscle cells. The above defined methods can allow the 
identification of agonists or antagonists of the expression of the hFATP gene. 

' ? Specific elements of the invention are further disclosed in the examples 
and in the figures: 

10 Figure 1: nucleotide sequence of clones containing 2 kb sequences obtained 
from' a cDNA library of human adipose tissue. The sequence on the upper line 
corresponds to the sequence encoding the hFATP protein and the sequence on 
the bottom line corresponds to the alternative form of splicing found in the 
analyzed clones 

is •' - - . ■ ; • ' ■ • ■ 

Figure 2: Alignment , of the nucleotide sequences of hFATPt, mFATP 
(MMU15976) and rFATP (RNU89529) and of the primary amino acid sequence, 
using the J. Hein method. The glycqsylation sites are indicated by an asterisk 
and are boxed. The sequence of 1 1 amino acids common to the members of 

20 the family of AMP-binding proteins is boxed on the alignments. 

□ Amino acid different from the murine sequence 



Consensus sequence of 11 amino acids characterizing the family of 
AMP-binding proteins 

25 □ Potential glycosylation site. 

Figure 3: a variant of the nucleotide sequence of clones encoding the human 
FATP protein and obtained from a cDNA library of human adipose tissue. 

30 Figure 4: sequence of the genomic sequence expressing the human FATP 
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protein, including the introns (lower case) and the exons (upper case). 

Figure 5: The amino acid sequence encoded by the nucleotide sequence of 
figure 3 and corresponding to the human FATP protein. 

5" 

Figure 6: Comparative analysis based on the Garnier-Robson, Chou-Fasman 
and Kyte-Doolittle algorithms of the murine and human FATP proteins. 

Figure 7: Primer extension with the reverse GM6 primer on 10 ug of human 
10 visceral adipose tissue total RNA. 

Figure 8: Provisional genomic structure of the hFATP gene. A. Genomic 
structure of hFATP with its 12 exons and 11 introns. The numbers in the boxes 
surrounding the exons indicate the exon numbers. The numbers above the 

15 boxes indicate the position of the introns. B. The approximate size of introns 2 
to 11 (bp) is indicated between the dashes separating the two ends of the 
donor and of the recipient. Intron 6 was identified by sequencing the PAC 
clone. C. Alternative splicing leading to hFATP 1 or its alternative splicing 
product hFATP. The relevant intron is intron 1 . hFATP 1 is normally spliced as in 

20 . mice. The splicing of the variant of hFATPI resumes beyond the intron in the 
coding sequence but changes the resading frame. 

Figure 9. The 19p1 3 region. The arrow indicates the position of hFATP on 
chromosome 19, the shaded zone represents the frequency of the genes 
25 encountered in the different regions of the chromosome. The arm p of the 
chromposome is represented at the top. The region q is represented at the 
bottom. ,'•< 

Figure 10. A. Dose response of the effets of 9c-RA on FATP and ACS gene 
30 expression on FAO cells 

Cells were kept 18 hr in serum-free medium and treated for 6 hr with increasing 
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concentrations of 9c-RA (10* 8 M to 10" 4 M). Control cells were incubated with 
BSA and vehicle. 30 pg of total RNA were loaded per lane. Northern blot 
analysis was carried out and blots were hybridized with FATP, ACS, and actin 
* probes. Quantification was performed with an imaging densitometer (Biorad 
5 GS-670). Values were normalized by comparison with actin control probe and 
results are expressed as percent of control signal, relative arbitrary unit 
(RAIL). 

figure 10.B. Dose response of the effects of 9c-RA and at-RA on FATP gene 
10 expression on Hep-G2 cells. 

Cells were kept 18 hr in serum-free medium and treated for 6 hr with (10 " 8 M to 
10" 4 M) either 9c-RA and at-RA. Controls were incubated with BSA and vehicle. 
Northern blot analysis was carried out and blots were hybridized with an FATP 
and actin probe.- 

* 15 

Figure 1 1 . A. Dose response of 9c RA on differentiated 3T3-L1 cells. 
Differentiated 3T3-L1 cells were treated 24 hr with different concentrations 9c- 
RA (0-1 O^M). Northern blpt analysis was carried out and blots were hybridized 
with an FATP and ACS probe. Actin was used as control probe. A means 
20 differentiated. 

Figure 1 1 .B. Comparative induction of FATP and ACS in FAO and Caco2 cells. 
FAO(A) and CaCo2 (B) cells were serum deprived 18 hr and then treated 
during 6 hr with 9c-RA (10" 6 M). Northern blot analysis was carried out as 
25 described and blots were hybridized with an FATP, ACS and actin probe. 

Figure 12. Nuclear run on analysis of the effect of 9 cisRA on FAO and 3T3-L1 
differentiated cells. 

FAO cells were treated for 2 h with 9c-RA (10" 6 M) after 1 8 h of growth in serum 
30 deprived conditions. Differentiated 3T3-L1 cells were treated during 12 h with 
the same concentration of 9c-RA. Transcription rates were then determined for 
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the FATP, ACS and GAPDH genes in from control cells (-RA) or 9c-RA treated 
cells (+RA). A Bluescript (BS) template was used as a control. Densitometric 
scanning of the results is depicted at the right panel. , 

5 Figures 13, 14 and 15 

Dose-response of 9c-RA, at-RA and TTNPB and: in non-differentiated and 
differentiated 3T3-L1 cells. 

Differentiated (3T3-L1 A) or non-differentiated (3T3-L1) cells were treated 24 hr 
with each of the retinoids indicated. Northern blot analysis was carried out and 
10 blots were hybridized with an FATP (figure 13), ACS (figure 14) and LPL (figure 
15) probe. Values were normalized by comparison with actin control probe and 

results are expressed in percent of control (R.A.U.). 

t- 

Figure 16. Oleate uptake assay on 3T3-L1 differentiated cells, 
is Differentiated cells 3T3-L1 (3T3-L1 A) were treated for 24 hr with different , 
concentrations of TTNPB (1 0" 9 M to 1 0* 5 M) and 14 C oleate uptake studies were 
carried out. Results were normalized after protein quantification and expressed 
as % of control. Significant differences were indicated by an asterisk * P<0,05. 

20 Figure 17. Tissue expression of FATP in different human tissues. The human 
probe of the li-actin has enabled checking the regularity of the depots. The 
additional band obtained with this probe represents the actin specific of the 
muscle which is thus highly expressed in muscle and brain. 

25 Figure 18. Expression of FATP in different human cell lines and comparison 
with the NCI adipose tissue: surrenal cortex carcinoma cells, THP1: monocytes, 
THP1 diff: THP1 differentiated with . PMA, Caco2: cells of colon 
adenocarcinoma, HepG2: liver hepatoma cells, Hep3B; hepatocytes, JEG-3: , 
chorion carcinoma cells. 

30 • \ " * " '. \ • 

Figure 19. Distribution of FATP and LPL in muscular skeletal tissue, adipose 
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tissue, liver,, colon, and intestine tissues in human. Sc: -sub-cutaneous, Vis: 
visceral. » 

1 ♦ . ■ > ' -. 1 ■ - . ' . 

Exemple 1 

1". MATERIALS AND METHODS ^ ^ 
1. Materials 

BRL 49653 and fenofibric acid were provided by Janssen Research 

Foundation, in Beerse, in Belgium, by the Fournier laboratories, in Daix, in 

10 France and by Ligand Pharmaceuticals , in San Diego, in the United States. The - 

retinoic acids 9rQis-[etinoic acid (9cRA), ajl-irans-cetinoic acid (at-RA), were 

provided by Ligand Pharmaceuticals. All the other products, unless otherwise 

stated, were obtained from Sigma (St Louis, MO). 

..■<*? • ■ < ■' ■ ■ 

- ■ ■» . 

15 2. Human biopsies 

The various omenta! and subcutaneous human adipose tissues were, 
collected * from normal or obese subjects during plastic or reconstructive 
operations. The muscular tissues were collected from patients who had* 
undergone an operation in the hip region. They were the liver or skeleton 

20 muscle tissue. All the tissues were immediately frozen in liquid nitrogen. 

3. Animals and treatments 

Male rats of the Wistar strain were treated for various lengths of time with 
fenofibrate mixed with the feed in powdered form (weight/weight) or BRL 49653 

25 by gavage, at the concentrations indicated. The weight of the animals and the 
intake of feed were recorded daily for the 7-day treatments and every two days 
for the 14-day treatments/ the treatments with the fibrates did not cause 
significant changes in the quantity of fee consumed by the animals. Since each 
rat consumes approximately 20 grams of feed per day, the doses of 0.5, 0.05 

30 and 0.005% (weight/weight) of fenofibrate correspond to 320, 32 and 3.2 mg/kg 
of body weight/day. At the end of the treatments, the rats were sacrificed by 
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exsanguination under ether anaesthetic/ The triglyceride and cholesterol levels 
were measured using a colorimetric detection kit (Boehringer-Mannheim). The 
various tissues were collected, rinsed in 0.9% NaCI and immediately frozen in 
liquid nitrogen. The liver and the epididymal adipose tissue were weighed. The 
5 effects of starvation were studied on rats from which food had been withdrawn 
. 14 hours before being sacrificed. 

4. Cell culture • v. 

The cells of the different hepatic lines used were: Fa 32, a rat hepatoma 

10 cell line derived from Faza967 (Deschatrette et a!., 1974), the mouse AML-12 
hepatocytes (Wu et al., 1994), the human hepatoma line HepG2 and the FAO 
cells, a well differentiated subclone of the rat hepatoma line H4 II EC3. All the 
cells were maintained in Dulbecco's modified Eagle's minimal essential medium 
(DMEM), supplemented with 1 0% decomplementized foetal calf serum; except 

15 AML-12 and FAO. The AML-12 cells were maintained in DMEM/Ham's F-12 
medium supplemented with insulin, transferrin and selenium (ITS,. Collaborative 
Research), and, dexamethasone (0.1 pM). The FAO cells were cultured in 
DMEM/Ham's F-12 supplemented with 10% decomplementized serum, 
penicillin (200 lU/ml) and streptomycin (50 mg/ml). 

20 The mouse* preadipocyte cell lines ob1771 (Negrel et al., 1978) and 3T3- 

L1 (ATCC) were maintained in DMEM medium with 10%* lipid-free and 
decomplementized foetal calf serum. The 3J3-L1 cells were differentiated by a 
two-day treatment with dexamethasone (0.1 pM), isobutylmethylxanthine (0.25 
mM) and insulin (0.4 pM). These cells, following initiation of differentiation, were 

25 maintained for a further 8 days with insulin until their complete differentiation 
was obtained. , 

The L6 muscle cells were cultured in DMEM with 10% foetal calf serum 
up to confluence followed by>spontaneous differentiation. 

The Caco2 colon carcinoma cells (ATCC) were cultured in DMEM 

;>o medium, and 15% decomplementized foetal calf serum as well as antibiotics 
and non-essential amino acids. Above 80% confluence, the cells differentiate 
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spontaneously for 20 days. ' * ^ 

5. RNA analysis - . \ 

5.1. Extraction of RNA and quantitative analysis 

The RNAs of the tissues and cells were prepared by the technique using 
caesium chloride and guanidine isothiocyanate (Chomeczynski et al., 1987). 
For the adipose tissue, a first step of centrifugation at 4°C of the 
homogenization product was added in order to remove the lipids, which are 
solidified at the surface, and to continue the extraction under * the best 
-* conditions. The dot-blot and Northern-blot hybridizations of the total or 
messenger RNAs after separation on a poly A column (Stratagene) were 
carried out as described in Auwerx et al. (Auwerx etaL, 1988). Before being 
deposited, the RNAs were assayed in duplicate by spectrophotometry 
(Pharmacia Ultrospec 2000), the difference tolerated between the values was 
less than 5%. The various messengers were measured using cDNA fragments 
obtained by reverse transcription, rt-PCR or by restriction, as probes. The 
probes were labelled with a random primed labelling kit (Boehringer-Mannheim) 
and a- 32 P]dCTP (NEN, Boston, MA). The filters (nylon and nitrocellulose, Pall 
Filtron) were hybridized overnight at 42°C with 10 6 cpm/ml. The washes were 
carried out for '10 minutes at 42°C and for twice 30 minutes at 65°C in a v 0.5 X 
SSC buffer and 0.1 % SDS. Next, these blots were exposed : to 
autoradiographic films (X-Omat AR or Biomax MS, Kodak). The autoradiograms 
were analysed by densitometric scanning (Biorad GS 670 densitometer). 

5.2 Measurement of the transcriptional activity by run on or transcription in vitro 
The nuclei of the cells and of the tissues were prepared according to the 
Nevins technique (Nevins, 1987). The total RNAs labelled with ct- 32 P]UTP 
(3000 Ci/mmol) (NEN, Boston, MA), in quantity of equivalent radioactivity, were 
hybridized on membranes on which there had been deposited 5 pg of the 
different plasmids or 1 pg of the restriction fragments of the genes of interest. 
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5,3. Primer extension 

The location of the site of. initiation of hFATP was determined as 
described in Sambrobk et al. (Sambrook et al., 1989). An antisens 
oligonucleotide, starting at +6 bases from the site of initiation of translation, was 
used 3'-CCC GCA TCC CTT CAC GTC CCG TCT CCC-5', 10 jjg of total RNA 
of human adipose tissue were precipitated with 500,000 cpm of primer labelled 
in 5' using T4 polynucleotide kinase (Amersham, Courtaboeuf, France with y- 
32 P]ATP, denatured for 5 minutes at 95 °C and annealed for 90 minutes at*65°C 
, in a hybridization solution. The extension was carried out at 42°C for one hour 
with 50 U AMV-RT (Life Technologies, Paisley, GB) and 100 U MMLV-RT (Life 
Technologies, Paisley, GB). A control sequence reaction was used as standard 
.mass marker to locate the 5' end of the extension product. ■ 

6. DNA techniques 

6.1 . Screening of complementary DNA libraries and of genomic DNA of phages 
Three libraries were" used to isolate the FATP gene, including a 
commercial cDNA library (Human fat celt 5'stretch plus cDNA library HL3016b, 
oligod'1-primed, Clontech laboratories, Inc. USA). After titration of the libraries, 
10 6 plaque forming units (PFU) served for the infection of 6 ml of the respective 
competent bacteria (Y1090,; XL-1 and SRB) OD 600 =1, maintained in Luria 
Bertani medium with 0.2% of maltose and 10 mM MgS0 4 . The adhesion , of the 
phages to the bacterial walls was carried out for 15 minutes at 37°C in 14-ml 
polypropylene tubes. The mixture was then spread with TOP agar (£& amine, 
casein hydrolysate, yeast extract (NZY) + 0.7% agarose) on 10 culture dishes 
15 cm in diameter. The lysis was carried out overnight at 37°C. Two bfottings 
on nitrocellulose filters (NEN, Boston, MA) were carried out on each dish, taking 
care to note the orientation of the dishes. The filters were then, treated in a 
denaturing, neutralizing and washing solution, and then fixed under UV or with 
heat. The hybridization, washing .and exposure conditions were the same as 
those used for the RNA techniques: In each screening, a double-stranded 
probe was used, derived from, rt-PCR or from restriction. The probe was for 
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example a 652 pb fragment, amplified with the primer sense. GM1 (ou 382) (5- 
ATG CGG GCT CCT GGA GCA GGA ACA -3') and antisense GM2 (ou 399) 
(3'-CTG CGT GTC AGG CAG GAT GCT TCT AGG CCC-5') covering the 5' end 
of mFATP cDNA. The clones isolated from a first screening were then purified 
by two additional screenings. A final step consisted in amplifying the titre of the 
phage clone isolated. - 

6.2. Subcloning of the hFATP phage clones 

The phage DNA was then prepared according to the Sambrook , 
technique (Sambrook et al., 1989) or amplified by PCR with a- 'high fidelity 
polymerase (Boehringer-Mannheim), and then mapped and excised at the level 
of the cloning site in the phage in order to then introduce it into a bluespript 
(BS) type vector, pBS-KS. The inserts could thus be amplified, sequenced, 
mapped and manipulated more easily. 

6.3. Sequencing and alignment of sequences 

The first clones were sequenced manually with the T7 sequencing kit 
(Pharmacia) or on a PCR product after treating the inserts with a phosphatase 
and an exonuclease I in order to remove the primers and the dinucleotides 
remaining whch interfere with the sequencing reaction. The product 'of 
sequencing was then deposited on a 6 or 7% acrylamide gel, the migration 
lasted 3-4 h at 60 mA. The sequence was read on the autoradiogram and could 
be as high as 200 bp. 

The sequences were then prepared on an automatic sequencer (ABI 
377, Perkin Elmer) with a PCR sequencing reaction using fluorescent 
terminators (ABI Prism big dye terminator, cycle sequencing ready reaction kit, 
Perkin Elmer Biosystem), the reading is automatic and may be as high as 700 
bp. The sequence alignments and searches were carried out in a first instance 
with Genbank by means of the Lasergene Navigator software (DNA Star) and 
then in a second stage using the Internet on a national center for biotechnology 
information (NCBI) server. The search for homology, for the cDNA fragments 
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was carried out using the Blast search programme. The online mendelian 
inheritance in man (OMIM) searches made it possible to identify the genes 
located on the same chromosome as hFATP. 

6.4. Search for introns 

The search for the intron/exon was carried out by PCR amplification on 
genomic DNA by comparison with amplifications on cDNA (phage clone, 
plasmid) or rt-PCR. Direct sequencing on plasmid artificial chromosome (PAC) 
(Genome systems Inc., St. Louis, USA) were undertaken. The junctions were 
identified after alignment of the sequences. 

7. Measurement of the transport of oleate 

7.1. Preparation of the fatty acids 

The labeled fatty acids ( 14 C oleate, 40-60 . mCi/mmol, NENE) are 
incubated in the presence of albumin in a ratio of between 0.5 and 2 for 45 
minutes at 37°C in 1X Hank's solution. 

7.2. Measurement of transport 

The cells treated with the various activators are rinsed with a Hank's 
solution and then incubated for 1 h with serum-free and glucose-free medium. 
The cells were again rinsed with a 1X Hank's solution with 0.2% of bovine 
serum albumin (BSA) at 37°C and then at 23°C, the transport being measured 
at room temperature. The incubation of the cells with labeled oleate lasts for 1 
minute, the stop solution was an ice-cold Hank's solution. The cells were lysed 
with 0.1% SDS, the proteins were assayed on 20pl, the remainder was 
supplemented with 4 ml of scintillation liquid and counted. 

8. Chromosomal location 

The FATP probe was labelled with digoxigenine-11-dUTP by nick 
translation (Boehringer-Mannheim). The probe was purified and concentrated 
by precipitation in order to obtain a concentration of 50 ng/(jl. The chromosomal 
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preparation was obtained from t lymphocytes stimulated with 
phytohaemaglutinin. The cells thus stimulated were synchronized after 48 h of 
culture by the action of methotrexate for 17 h and then the cell cycle was' 
unblocked by the addition of thymidine. After 4h 30 min of action, the cells were 
blocked in the metaphase by the action of colcmid. After a hypotonic shock and 
fixing, the preparation was deposited on a slide; to obtain good precision, the 
techniques of G bands, obtained by the action of trypsin and after Giemsa 
staining (GTG banding) (Francke et al., 1978) and the FISH technique were 
combined. The metaphases were then selected on a DMRXA microscope 
(Leica). The were karyotyped and stored in a software for image analysis 
(CHANTAL) developed by Leica. After non-isotopic in situ hybridization (Geffroy 
et al.. 1995) with the FATP probe and , detection, the metaphases were 
examined with the aid of a fluorescence microscope (Leica type* . DMRXA) 
combined with a Leica image analyzing system. The software allowed 
superposition of the metaphase in the G band and in FISH. 

2. ISOLATION AND CHARACTERIZATION OF THE HUMAN FATP GENE 
1 . Results 

1.1 . Isolation of the complete cDNA from the human gene for FATP 
1.1.1. Purification of the hFATP clones 

Several human adipose tissue cDNA libraries, (including one commercial 
human fat cell 5' stretch plus cDNA library HL3046b, oligodT primed, Clontech 
library) were screened with a murine probe in order to isolate various cDNA 
clones of the human FATP gene. The first screening was carried out on the 
commercially made human adipose tissue cDNA library (Clontech). The probe 
used was a 5' fragment of 654 bp (position 1-654) amplified by rt-PCR on 
mouse adipose tissue RNA with primers 382 (5'-ATG CGG GCT CCT GGA 
GCA GGA ACA-3') and 399 (3'-CTG CGT GTC AGG CAG GAT GCT TCT AGG 
CCC-5') or alternatively (GTG TCA GGC AGG CAG GAT GCT CTC). This 
probe was inserted into a bluescript plasmid, pBS-KS, at the level of the EcoRV 
site and the sequence of this plasmid, mFATP399, revealed 100% homology 
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with the murine sequence. The screening provided 23 positive clones from 10 6 
PFU, the probability of isolating a gene having a normal frequency being 1 out 
of 1. million. Next, a second screening was undertaken on another library under 
the same conditions for titration and spreading , of the phages* and for 
hybridization. This screening provided 6 clones which were identified as 
containing FATP. These clones were purified over three successive screenings. 

• The cDNA ' inserts, excised after digestion with EcoRI, provided 
fragments in/a size interval from 300 bp to 2 kb. These clones were amplified 
by PCR using the lambda phage gt1 1 primers (X. gt1 1 Forward and X gt 1 1 
Reverse). The sequencing of these PCR products allowed us to differentiate 1) 
the clones whose DNA sequences were homologous to mFATP\ they are the 
partial hFATP clones, 2) the clones whose .sequence was overlapping or even 
identical and 3) the. clones whose DNA sequence did not align with mFATP. 
The latter category comprises false-positives which proved positive in 
hybridization because of homology with the probe used but whose sequence c 
differentiates them from hFATP. They may be genes already identified or open 
reading frames (ORF) not yet identified. * 

Three large inserts, homologous to mFATP (1.6 kb clone 2.a, 2 kb,,clone 
3.e and 1 kb, clone 3.g starting at nucleotide +84 following the ATG codon), 
were subcloned to a Bluescript vector at the level of the EcoRI site and 
completely sequenced over the sense and antisense part. These fragments 
were then aligned with the mouse sequence which, was, with the rat sequence 
rFATP, highlythomologous, the only two sequences known from 1994 to the 
end of 1997. The 1 .6 kb clone aligned from position 487 on mice up to position 
1941 which corresponds to the TGA codon, that is to say the end of the murine 
coding part. This clone therefore had an additional 3* part (147 bp) which was 
identified by rt-PCR as the 3' untranslated, transcriptional region (3'UTR) and of 
the FA TP gene, a region which is transcribed but not translated. \ ; 

The 2 kb clones appeared to be identical during total amplification of ! 
these 2 kb with the Xgt1 1 F and >.gt11R primers, by PCR. The sequencing of 
these clones made it possible to differentiate them: hFATPI and an alternative 
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.form of splicing. The open reading frame of this clone is 1941 bp for hFATPI, 
that is to say identical to the murine coding sequence of 1941 bp. Its alternative 
form of splicing had an 82-base deletion compared with hFATPI. This deletion 
is situated from position 168 to 249 on the mouse sequence or Xhe^hFATPI. 
5 sequence. This deletion could result from an alternative splicing, which was 
^ thus analyzed with the genomic structure: These two clones had 22 bases of 
5'UTR and 265 bases of 3'UTR. . ( 

1.1.2. Characteristics of the clones 

10 A search in the NCBI library revealed 84% similarity at the nucleotide 

level between hFATPI and MMU15976 (murine FA TP sequence identification 
name) and 83% between hFATPI and RNU89529. (rat FATP sequence 
identification name). The murine FATP sequence exhibits 93% at the 
nucleotide level with the rFATP sequence. A search for sequence homology 

is with the entire library of genes listed by NCBI was carried out with the Blast 
search program: This analysis revealed 57% homology at the nucleotide level 
with the cDNA for rat acyl coenzyme A synthetase (D851 00) with an alignment 
over 351 bases, 58% with the murine hbmologue of ACS (AF33031) with an, 
alignment over 348 bases. The hFA TP1 sequence also exhibited 94% identity 

20 over 18 bases (D88308) and 56% over 210 bases with the human homologue 
of ACS. The center of this region of homology is indicated in Figure 2. 

At the amino, acid level, the protein exhibits 89% homology with the^ 
murine protein, .mFATP, and 89% with the rat protein rFATP/ rFATP and 
mFATP have 93% homology. Yeast Fat,1P had, for its part, only 54% 

25 homology with mFATP. -\ 

The hFATP protein comprises 646 amino acids, its calculated molecular 
weight is 71 kDa and its isoelectric point is 8.5. We compared the amino acid 
sequence of hFATPI and from its 1 alternative splicing form. Said alternative, 
from encodes a protein of 57 amino acids and ends with an ambre mutation 

30 which stops translation. By observing the translation in the three phases of the 
complete sequence, we noticed that the sequence following this termination 
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site aligned farther away in another reading frame with hFATPL Such a 
phenomenon can only be due to a splicing error which can be confirmed by 
analyzing the genomic structure. 

Analysis of the composition of the protein by the DNA Star software 
5 shows 40% of hydrophobic amino acids, which is in agreement with the fact 
that FATP is a transmembrane protein. hFATP has a signal sequence of 30 
amino acids (von Heijne, 1986). The amino acid sequence of hFATP shows 
three potential glycosylation sites (N-X-S/N-X-T), these sites are conserved in 
mice, rats and humans. Only one of these sites is identical to the yeast Fat 1P 

10 protein whose amino acid sequence was published in 1997 (Faergeman et al., 
1997). In a very advantageous manner, a sequence of eleven amino acids is 
perfectly identical and conserved between hFATP, mFATP, rFATP, Fat 1 and 
rACS. This oligopeptide (IYTSGTTGLPK) is common to members of the family 
of AMP-binding proteins. On the basis of this sequence conservation, the 

is common evolutionary line for the FATP; Fat 1P and ACS genes can be 
confirmed. 

The structural units characterizing the secondary structure of the FATP 
protein was studied by comparing its protein analysis using the protean 
program of the DNA Star software (Figure 6). « 

20 Using the kyte and Doolittle algorithms, the hydrophilicity profiles of 

hFATP and mFATP was compared (kyte et al., 1982). An analysis of the 
primary amino acid sequence based on the Chou-Fasman calculations was 
then carried out in order to locate the a and p sheets,, and the bent and helical 
regions (Chou et al., 1974). This study was complete with a Garnier-Robson 

25 analysis locating the same regions but with a different method (Gamier et al., 
1978). These analyses demonstrated that these proteins exhibited comparable 
profiles. 

The 3VJR regjon does not. contain putative polyadenylation sites or 
destabilizing consensus sequence (ATTTA), which suggests that the complete 
30 message is much longer than 2 kb. This hypothesis was confirmed further to a 
commercial Northern-blot hybridization with a human FA TP probe. The size of 
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the messenger is approximately 4.4 kb. ' 

1 .1 .3 .Identification of the site of initiation of transcription 
• k . In order to identify the site of initiation of transcription, a 5' extension, 
5 primer extension, with an antisense primer (reverse GM6) positioned in 5' of the 
coding region was carried out. The number of bases between the primer and 
the extension product was 34 bases, the primer was 27 bases long, which 
locates the site of initiation 61 bases upstream of the ATG. This distance is 
relatively short compared with the average distances of the sites of initiation 
10 which, in addition, may be highly variable. The location was . checked with a 
rapid amplification ofcDNA ends (RACE) technique. 

1 .2. Isolation of the human FATP gene and genomic characterization * 
1.2.1. Isolation of the human genomic DNA clone 1 

In the second instance, a human genomic DNA phage library was 
hybridized with a human probe and 6 genomic clones were isolated from 10 6 
PFU. The genomic characterization of human FATP was carried out by CPR on 
the genomic clones and total genomic DNA. ?' ' ■ 

A PAC clone, derived from a plasmid capable of inserting fragments up 
to 120 kb was used. This clone was obtained using two primers, the first 
located from nucleotide 1267 to 1288 (GM3) and' the second from nucleotide 
1495 to 174 or more preferably' from nucleotide 1469 to 1489 (GM4) in the 
sequence. The screening was carried out by PCR on human genomic DNA. 
The determination of the introns on this PAC was carried out by automatic 
sequencing using a sequencing technique developed for long DNA fragments 
(Fajas et al., 1997). Unlike the genomic clones previously isolated, the 
PAC14957 clone did not contain the pseudogene. Indeed, the primers used 
were intended to amplify the region which contained an intrbn, which made it 
possible to avoid again isolating the pseudogene. For the remainder of the 
characterization/this PAC 14957 clone arid the human total genomic DNA being 
checked were used. 
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1.2.2. Structure of the gene 

On this PAC clone, the introns situated at the 3* end of the FATP gene 
up to position 997 were identified. Beyond this region, towards the 5* end, no 
5 alignment could be achieved on more than 3,000 bp sequenced on a PAC 
14957 clone by the technique of DNA walking towards the 5' region. 
Furthermore, this PAC clone did not appear to hybridize with oligonucleotides 
having the 5' part of the cDNA. In order to determine the positions of the other 
introns, a PCR strategy was applied to human genomic DNA in comparison 
10 with the rt-PCRs carried out on the human adipose tissue RNA (Figure 8). In 
order, the size of the exons is: 167, 395, 162, 70, 92, 111, 209, 127, 138, 165, 
147 and 158 bp. 

1 .3. Chromosomal location of FATP 

15 With the aim of determining the chromosomal location of hFATP, an in 

situ hybridization technique was, used. Twenty interpretable metaphases, 
following the chromosomal preparation of T lymphocytes, 1 were selected for 
hybridization with the hFATP probe. The in situ hybridization with the 
PAC14957 probe, a genomic clone of hFATP, labelled with digoxigenin, was 

20 located on chromosome 19, more precisely 19p13.1. 

This region is very rich and 126 genes, or locus responsible for disease 
were located jn this region. Among the genes of greatest interest, at the level of 
the lipid and carbohydrate metabolism, situated in. the same 19p13 region, 
there are: the familial hypercholesterolaemic locus, the insulin receptor, the 

25 atherogenic lipoprotein phenotype, apolipoproteia C-l and the locus involved in 
apoC-l I deficiency. 

2. Discussion 

The human homologue of mFATP was isolated by screening a human 
30 adipose tissue cDNA library. Two messengers were differentiated: hFATPI 
which encodes a protein of 646 amino acids and hFATP2, the truncated form of 
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hFATPI, which encodes a protein of 57 amino acids. The functionality of 
hFATP2 is very unlikely and is derived from an alternative splicing. hFATPI 
comprises 12 exons and 11 introns. 

The hFATPI protein is hydrophobic in nature and its protein profile is 
very similar to that of mFATP; hFATPI is a hydrophobic, transmembrane 
protein with three glycosylate sites and a potential signal sequence of 30 
amino acids. Analysis of sequence homologies with other proteins made it 
possible to reveal 11 amino acids which are perfectly identical between ACS 
and FATP. These amino acids would constitute a' binding site common to these 
two proteins. Shaffer and Lodish had already suggested that FATP and ACS 
could act in concert; the inventors have now shown analysis of gene regulation 
that FATP and ACS are regulated in a coordinated manner in tissues sensitive 
to insulin. FATP is thought to allow the entry of fatty acids into the cell and ACS 
is thought to convert them to active metabolic compounds, the acyl CoAs, for 
storing energy as a reserve or for the production of energy. 

The location of FATP gene on chromosome 19, more precisely 19p13.1, 
is of interest since this region comprises numerous genes involved in the 
metabolism of fatty acids: the locus for apo C-ll deficiency, the metabolism of 
glucose, the insulin receptor and the metabolism of cholesterol with the locus 
for familial hypercholesterolemia. FATP is thought to form part of a syntenic 
group with one of these other genes and to segregate in the same manner. 
Thus, ap C-ll deficiency causes a decrease in the LPL activity and is 
accompanied by accumulation of particles high in triglycerides. The insulin 
receptor has been very widely studied in order to search for the causes of 
insulin-dependent diabetes, an abnormality in this receptor prevents the action 
of endogenous and exogenous insulin. Familial hypercholesterolaemia due to a 
mutation in the receptor for the LDL particles causes the development of 
atherosclerosis and of cardiovascular diseases. All these manifestations have 
in common an accumulation of lipids in the bloodstream which may lead to 
insulin resistance. 
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Example 2 

INDUCTION OF THE FATP GENE BY DIMER SELECTIVE RETINOIDS 
SUGGESTS THAT PPAR-RXR HETERODIMER IS ITS MOLECULAR 
TARGET. 

, Free fatty acids can be released from adipocytes by the hormone- 
sensitive lipase (HSL) or from tripglycerides-rich lipoproteins by lipoprotein 
lipase (LPL). Circulating fatty acids can then cross the plasma membrane either 
by virtue of their lipid solubility (Higgins, 1994) or be taken up by cells in a 
process mediated by. the fatty acid transport protein (FATP). A second protein 
found to be involved in the process of long-chain fatty acid uptake is the Acyl* 
Coenzyme A synthetase (ACS). FATP acts as a transporter of fatty acids, 
whereas the role of ACS is rather confined in preventing the efflux of fatty acids 
through an esterification process. . 

FATP and ACS mRNA levels are regulated in a tissue-specific manner 
by peroxysome proliferator-activated receptors (PPARs) (Martin et al. f 1997). 
PPARs are members of the nuclear receptor gene superfamily. Expression of 
LPL, FATP and ACS is known to be under the control of hypolipidemic and 
hypoglycemic drugs, an effect mediated by PPARs. Three retinoic acid 
receptors, termed RARa, -p and -y, and three retinoid X receptor, designated 
RXRa, -p and -y, are classically thought to transduce the effects of retinoic acid 
(RA) on gene expression. Both 9-c/s RA (9c-RA) and all trans-RA (at-RA) can 
directly bind and activate RARs, whereas RXR doesn't bind at-RA, but binds 
9c-RA. RXR-PPAR heterodimers respond to both RXR and PPAR ligands 
(Kliewer et al., 1992). 

RXR agonists function as insulin' sensitizers and have beneficial effects 
on hypertiglyceridemia, hyperglycemia and hyperinsulinemia in mouse models 
of NIDDM and obesity (Mukherjee et al., 1997). This effect is not secondary to a 
variation in the weight of the animals. Furthermore, some of these rexinoids 
allow to determine which heterodimer is specifically involved in gene regulation 



WO 99/51740 PCT/EP99/02295 

28 

and hence these ligands provide a new tool for a better understanding of RXR 
action and the development of new pharmacological compounds. 

Because thiazolidinediones, which are PPARy specific ligands, are 
thought to exert part of their antidiabetic effect through an alteration of the fatty 
acid partitioning and activation of PPARy-RXR, the inventors proposed that 
RXR ligands could have similar effects and be potential activators of FATP, 
LPL, and ACS expression. LPL, FATP, and ACS are involved in fatty acid 
partitioning and the levels of circulating free fatty acids depend oh the 
expression of these genes. PPARy activators improve glucose homeostasis 
and this effect may be due to the regulation of LPL (Schoohjans et al., 1996), 
FATP and ACS target genes. In order to confirm the implication of PPAR-RXR 
heterodimer in the improvement of glucose homeostasis, the regulation of LPL, 
FATP, and ACS genes by retinoic acid was studied. This allowed to determine 
which dimer was*involved in this process and provided a better understanding 
of the regulation of lipid uptake by nuclear receptors: The PPAR-RXR 
heterodimer as the molecular target mediating these effects on fatty acid 
partitioning leading to an improvement of insulin sensitivity. 

1. Material and mftthnHs 
Abbreviations 

RA, retinoic acid; at-RA, a\\-trans RA; 9c-RA, 9-cis RA; RAR, Retinoic acid 
receptor; RXR, retinoid X receptor; RXR-RE, RXR response element; TTNPB, 
ethyl-p-[(E)-2-(5,6,7,8-tetrahydr6-5,5,8,8-tetramethyl-2-naphtyl-1- 
propenyl]benzoic acid; PPAR, peroxisome proliferator-activated receptor; 
DMSO, dimethyl sulfoxide. 

Materials 

at-RA was purchased from Sigma (St. Louis, Mo.) 9c-RA, TTNPB (Mangelsdorf 
et al. , 1 990) was obtained from ligand pharmaceuticals. 
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Cell culture and treatments 

FAO cells are a well-differentiated subclone derived from the rat' 
hepatoma ' H4 IIEC3 line. They ' were cultured at 37°C in a humidified 
atmosphere of 5%C02, 95% air in Ham F-12 medium (Gibco-BRL) containing 
10% fetal calf serum, penicillin '(200 lU/ml) and streptomycine (50 mg/ml) 
according to previously published, procedures (Meunier-Dumort et al. t 1996). 
Culture medium was changed every 48 hr; Experiments were performed on 
subconfluent cells maintained in culture under serum free conditions. at-RA and 
9c-RA* were dissolved in dimethyl sulfoxide (DMSO) while oleic acid was 
complexed with bovine serum albumine before addition to the cells (BSA). The 
human hepatoma cell line HepG2 was obtained from ECACC (Porton Down, 
Salisbury, United Kingdom) and the mouse preadipocyte cell line 3T3-L1 from 
(ATCC). These cells were maintained in Dulbecco's modified Eagle's minimal 
essential mediunri (DMEM), supplemented with 10% fetal calf serum (LPDS- 
DCC), L-glutamine and antibiotics, unless stated otherwise. 3T3-L1 cells were 
differentiated initially by a 2 day-treatment with dexamethasone (0.1 mM), 
isobutyl methyl xanthine (0.25mM) and insulin (0.4 pM). Subsequently, the cells 
were then maintained for an additional 8 days with insulin until complete 
differentiation. Experiments were performed on subconfluent cells maintained in 
culture under serum free conditions. at-RA and 9c-RA were dissolved in 
dimethyl sulfoxide (DMSO) while oleic acid was complexed with bovine serum 
albumine (BSA) before addition totte cells. / \ 

RNA analysis 

RNA preparation, northern blot hybridizations, and quantification of total 
cellular RNA were performed as described previously (Auwerx et al., 1988). A 
mouse FATP cDNA probe was obtained, after cloning an RT-PCR fragment 
from mouse adipose tissue RNA (primers ATG CGG GCT CCT GGA GCA GGA 
CAG CC and CTG CGT GTC AGG CAG GAT GCT CTC. AGG CCC) into 
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pBluescript-KS. The insert was sequenced and found to be identical to the 
reported mouse FATP sequence. The rat ACS probe corresponds to the 
EcoRV restriction fragment of the rat ACS cDNA and an EcoRI fragment of the 
hLPL26 clone was used to measure LPLmRNA. p-actin was used as control 
probe. * 

Isolation of nuclei and transcriptional rate assay 

Nuclei were prepared from FAO and 3T3-L1 differentiated cells and 
treated either with 9c-RA or vehicle. Transcription run-on assays were 
performed as described by Nevins (Nevins, 1987). Equivalent counts of nuclear 
RNA labeled with [a- 32 P]UTR (3000 Ci/mmol) were hybridized for 36 hr at 65°C 
to 5 pg of FATP, ACS, GAPDH and vector DNA (pBluescript) immobilized on 
Hybond-C Extra filters (Amersham). After hybridization, filters were washed at 
room temperature for 10 min in O.SxSSC and- 0,1% SDS and twjce at 65°C for 
30 min and subsequently exposed to X-ray film (BIOMAX-MS, Kodak). 
Quantitative analysis was performed by scanning densitometry (BioRad GS670 
densitometer). , \ 

Preparation of albumin-bound fatty acids and Fatty acid uptake assay 

Radiolabeled 14 [C] oleate fatty acid. was added to water at 40°C. Albumin 
(BSA; fraction V, fatty acid free, Sigma; St Louis, Mo) was then added from a 
concentrated stock (20 g/100 ml) to give a final molar ratio of 1/1 by gentle 
mixing. 2 X Hank's solution was added to obtain a 1 X final solution. Incubation 
was carried out at 37°C for 45 minutes. ' - . 

The measurement of uptake of 1 1 *[C]-labeled oleate (about 50 
mCi/mmol, NEN, Boston, MA) was carried out in 24 or 6 well plates with 10 6 
cells / ml of medium. Before treatment, the cells were washed with 1X Hank's 
solution. BRL 49653 (100 - 250 nM) and fenofibric acid (100 - 250 uM) and the 
other retinoids (dose range from 10' 9 M to 10" 6 M) were added in fresh DM EM 
medium containing 10% FCS ; After 48 h of treatment, cells were washed with 
Hank's solution and incubated for one additional hour in serum-free, glucose- 
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free medium. Cells were then washed once at 37°C and twice at 23 °C with 1X 
Hanks solution containing BSA. Hank's solution without BSA was then added 
before the assay. A volume corresponding to IpCi of 14 [C]-oleate albumin- 
bound solution was added in each well and cells were .incubated for 1 min at 
room temperature. Incubation was stopped after. 1 min with 3 washes of ice 
cold 1X Hank's solution without BSA. A complementary experiment has been 
performed to verify whether a specific cell surface binding of 14 [CJ-oleate could 
interfere with the assay. For this second assay, the cells were washed under 
more stringent conditions in 1X Hank's solution containing 0.5% BSA. Cells 
were then lysed in 400 pi of 0.1% SDS solution. The lysate was counted for 5 
min with 4 ml of scintillation solution. Assay was performed on triplicate points. 

Results 

1.9C-RA induces FATP and ACS mRNA levels in the hepatoma cells FAQ and 
HS PQ 2 

In order to determine whether FATP and ACS genes were regulated at 
transcriptional level by retinoic acid, a dose-response experiment was 
performed. FAO cells were cultured for 6 hr with increasing doses of 9c-RA 
(10 " 8 to 10" 4 M) whereas control cells were incubated with vehicle alone (0,1% 
v/v DMSO). Incubation with 9c-RA resulted in a dose-dependent increase of 
both FATP and ACS mRNA levels in this hepatic cell line (figure 10A). A 
maximal increase for FATP (>9-fold) and ACS (14-fold). was observed with a 
dose of 10' 6 M 9c RA. FATP and ACS m RNA are also upregulated by at-RA but 
to a lesser extent that 9c-RA (data not shown). 

Induction of FATP and ACS mRNA levels in the human hepatoma cell 
line HepG2 was analyzed. 9c-RA induced FATP and ACS mRNA after 24 hr 
(dose 10" 6 M) (figure 10B). However, no effect of at-RA could be detected (data 
not shown). 
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2. 9C-RA induces F ATP and ACS mRNA levels in di fferentiated Caen? and 
3T3 - L1 c e ll s. 

Since adipose tissue is an important insulin-sensitive tissue, it was 
interesting to know whether retinoids also regulate FATP and ACS gene 
expression in mouse adipose cell line, 3T3-L1. 3T3-L1 cells were therefore 
completely differentiated and then treated with 9c-RA (10" 9 to 10* 6 M). 9c-RA 
induces FATP and ACS mRNA expression in a dose-dependent manner. 
Maximum increase was observed at a concentration of 10" 6 M 9c-RA (figure 
11 A). 

The expression of FATP and ACS induction was next compared between 
Fao hepatoma cells and the colon adenocarcinoma cells Caco2. Both FATP 
and ACS gene expressions are increased after 6 hr of treatment with 9c-RA at 
lO^M. However, higher levels of induction are nevertheless observed in FAO 
cells (figure 11B). 

3. The induction of FATP and ACS oene expression bv 9c-RA is at the 
transcriptional level 

Nuclear run-on analysis was next carried out to determine whether the ' 
induction of this expression by 9c-RA was a direct consequence of enhanced 
gene transcription. FAO cells were therefore treated for 2 hr with 9c-RA (10" 6 M) 
after 18 hr of culture in serum-deprived medium, whereas 3T3-L1 cells were 
treated for 24 hr with 9c-RA (10* s M) in normal medium without serum 
deprivation. FAO metabolism is greatly affected with lipids, serum deprived 
conditions allow to observe a strong effect which couldn't be detected in normal 
conditions. Controls were treated with vehicle only. Transcription rates for the 
FATP and ACS genes were induced 2-fold and 2.8-fold respectively in FAO 
cells and 4- and 3-fold respectively in differentiated 3T3-L1 cells. In the retinoic 
acid treated cells, transcription of the GAPDH gene was not affected in neither 
of the two cell lines. 
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4. Rexinoids. but not RAR agonists, induce FATP. A CS and LPL gene 
expression in differentiated 3T3-L1 cells. 

To determine whether the effect' of retinoids on the regulation of FATP 
and ACS gene expression was mediated by RAR or RXR, 3T3-L1 differentiated 
cells were treated with 9c-RA, a panagonist of both RAR or RXR, or at-RA and 
TTNPB which are specific agonists of RAR. No activation of LPL, FATP and 
ACS gene expression was detectable after 24 hr of treatment with at-RA or 
TTNPB. In contrast, a strong dose-dependent induction of both LPL, FATP and 
ACS was observed 24 hr after treatment with 9c-RA, a RXR agonist. Maximum 
induction occurred at 10* 6 M 9c-RA (4-fold and 3-fold for LPL, FATP and ACS 
respectively). Interestingly, no effect of non of the retinoids was observed on 
non-differentiated 3T3-L1 preadipocyte cells (figures 13,. 14 and 15). 

RXR agonists, but not RAR agonists, induce oleate uptake in 
differentiated 3T3-L1 cells. 

To establish whether the induction of the FATP-1 and ACS was accompanied 
by' a concommitant increase jn fatty acid uptake into the cells, we measured 
14 [C] oleate uptake in differentiated' 3T3-L1 cells exposed to increasing doses of 
prototypic retinoid, i.e., TTNPB, an RAR agonist (dose range 10" 9 M to 10" 6 M). 
No effect on 14 [C] oleate uptake was observed after TTNPB confirming that this . 
effect was specific for the rexinoids, (figure 16). 

Discussion • 

These results demonstrate that RXR ligands control both FATP and ACS 
gene expression in several cell types. In the liver, FATP and ACS have been 
shown to be strongly upregulated by PPARa activators (Schoonjans et al., 
1995; Martin et al., 1997). These distinct transcriptional effects are mediated by 
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PPARa interacting with ' responsive elements in the promoter regions of these 
genes. As PPARa effects are mediated through activation of the PPARa-RXR 
heterodimer, it was not surprising to observe a strong induction of FATP and 
ACS expression by specific RXR activators. Expression of FATP and ACS are 
co-induced by 9c-RA in the liver and in intestine. Free fatty acids generated by 
LPb are avidly taken up by these tissues and are converted in metabolic active 
acyl-CoA derivatives to sustain a high level of p-oxydation. the co-induction of 
FATP and ACS by 9c-RA, which resembles the induction of these genes by 
fibrates, indicates that 9c-RA and fibrates have a similar metabolic effect on rat 
and human hepatoma cell line (Kliewer et aL). These data, demonstrate that the 
RA pathway is implicated in the regulation of lipid metabolism and underline the 
powerful action of specific RXR agonists on the uptake and metabolism of long- 
chain fatty acids in the liver. 

In adipose tissue, a second insulin-sensitive tissue, FATP and ACS are 
also co-regulated by 9c-RA in a dose-dependent manner. PPARy is the 
predominant in this tissue and PPARy activators have been shown to induce 
both FATP and ACS expression. Hence, the induction of FATP and ACS 
expression by RXR ! agonists in the context of PPARy-RXR heterodimer 
suggests that RXR is also here an active heterodimer partner. 

The stimulatory effects of 9c-RA on FATP and ACS gene expression in 
both tissue implicate the RA pathway as an important regulation pathway in 
liver and adipose tissue where retinoids have a significant physiological role in 
fatty acid metabolism: Since FATP and ACS genes are implicated in fatty acid 
partitioning, it can be concluded that the distribution of fatty acids is affected in 
response to 9c-RA. This effect of rexinoids on fatty acid partitioning is different 
from the effects observed with PPAR activators and depends on the relative 
expression of the different RXRs and PPARx (Braissant et al., 1994; Lemberger 
et al. 1996; Auboeuf et al., 1997). RXR is present in both liver and adipose 
tissue and this pattern of expression determines the tissue specific effects of 
RA activation. In contrast, activity of PPARa agonists is mostly limited to the 
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liver, the prime site of expression of PPARa. Furthermore, the fact that the co- 
regulation of LPL, FATP and ACS genes by 9c-RA, TTNPB and at-RA depend 
on the differentiated state of 3T3-L1 cells points to the importance of PPArg in 
this process. In preadipocyte cells PPARy is expressed at low levels and its 
expression increases upon adipocyte differentiation. The absence of a retinoic 
acid response in differentiated 3T3-L1 cells expressing almost no PPARy 
suggest that PPARy is an obligatory partner of RXR in the retinoid-dependent 
regulation of LPL, FATP and ACS genes in adipose tissue. Hence, due to its 
more general. expression, it is expected- that the effects of rexinoids are more 
generalized than the effects of the more tissue-restricted PPARa (liver) and 
PPARy (adipose tissue) agonists. In fact, rexinoids effects on gene expression 
should resemble the effects of a combined PPARa and y agonist. This effect is 
direct on the FATP gene and suggests the presence of a response element. like 
it has been demonstrated in ACS and LPL gene (Schoonjans et al., 1996) but , 
also and recently in FABP gene (Poirier et al. f 1997). 

It seems that rexinoids, the new term used to specify RXR selective 
ligands collaborate with PPAR ligands to control the expression of FATP and 
ACS genes involved in fatty acid metabolism, the beneficial effect of PPARy 
activators on glucose homeostasis has been previously shown by * us to be in 
part due to a redistribution of fatty acids towards adipose tissue with a relative 
depletion' of fatty acids, in the muscle (Martin et al., 1998). As known since 
Randle's work in the 60 (Randle et al., 1963; Randle et al., 1964), depletion of 
muscle fatty acid content will result in an improvement of glucose homeostasis. ' 
Hence, it is tempting to speculate that the improvements of glucose 
homeostasis observed with rexinoids are similarly linked to an altered 
partitioning of fatty acids. In view of the distinct tissue distribution of RXR 
expression, induction of FATP and ACS by RXR ligands might not have the 
same tissue-specificity as .that of PPARa and, PPARy activators and could 
contribute to a different tissue partitioning of fatty acids. These agents provide 
hence an alternative way to regulate the expression of genes implicated in fatty. 
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acid distribution among the different tissues. * 

This study demonstrates that RXR ligands regulate the expression of 
LPL, FATP and ACS genes in several cell types. The retinoic acid pathway is 
an important signalling pathway for the regulation of genes which are implicated 
in fatty acid supply. Our results suggest that the PPAR-RXR complex is the 
; molecular target by which rexinoids regulate FATP and ACS gene expression. 
Furthermore, they define FATP and ACS as new target genes in the RXR- 
dependent signalling pathway. The observation that both rexinoids and 
peroxisome proliferators stimulate the transcription of these genes implicated in 
lipid metabolism, suggest that rexinoids may act as an hypolipidemic and 
hypoglycemic agent through activation of PPAR-RXR complex and are 
consistent with the hypothesis that the heterodimer PPAR-RXR heterodimer is 
the molecular target for the improvement of insulin sensitivity. 
TTNPB, which is an activator of RAR in the context of RXR-RAR heterodimer, 
has no effect on FATP-1 arid ACS. The combined results of the studies using 
these synthetic retinoids implicates the PPAR-RXR heterodimer as the 
molecular target of the regulation of FATP-1 and ACS by retinoic acids. Since 
PPAR and RXR are both active components of this heterodimer and since we 
have previously demonstrated that the FATP-1 and ACS genes are both 
regulated by PPAR activator (Martin, G. et al (1997) J. Biol. Chem. 272:28210- 
7), it appears that rexinoids could cooperate synergistically with PPAR ligands 
in the control of the expression of these two genes involved in fatty acid 
metabolism. Furthermore, the fact that the coregulation of FATP-1 and ACS 
genes by rexinoids depends on the differentiated state of 3T3-L1 cells points to 
the importance of PPARy in this process. In preadipocyte cells PPARy is 
expressed at low levels and its expression increases upon adipocyte 
differentiation. The absence of a rexinoid response is undifferentiated 2T3-L1 
cells, expressing almost no PPARy, hence also suggest that PPARy is an 
oligatory partner of RXR in the retinoid-dependent regulation of FATP-1 and 
ACS genes in adipose tissue. ,r 
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Example 3 

Expression of FATP in human tissues 

In order to determine whether the expression of FATP was ubiquitous or limited 
to some tissues, hybridization experiments were performed with a human probe 
radioactively labeled on a commercial Northern Blot (Clontech) containing 
mRNA of 8 human tissues. These tissues are heart, brain, placenta, lung* liver, , 
skeletal muscle, kidney, and pancreas tissues (figure 14). The expression is 
very high in the skeletal muscle, high in heart and pancreas, medium in brain, 
weak in placenta, liver, and very- weak in kidneys. The human mRNA 
corresponding to FATP has a size of 4.4 kb. The human p-actin probe has 
been used for the normalization and the top signal represents the ubiquitously 
expressed isoform, whereas skeletal and cardiac muscle express a specific p- 
actin isoform. 

The commercial Northern Blot did not contain adipose tissue probably due to 
difficulties in performing its extraction. Therefore, a supplementary Northern 
Blot analysis was performed* on various human cell lines and adipose tissue. 
The expression of FATP was analyzed. FATP was very highly expressed in 
adipose tissue. To a weaker extent, FATP was expressed in hepatocyte and 
monocytic cell lines. The expression was detectable in chorionic carcinoma 
cells (Jeg3) and human colon cells (Caco2). 

The high level of FATP mRNA in the skeletal muscle has been checked on 
several skeletal muscular tissues and the level of expression of the FATP 
mRNA in this tissue has been compared with the level in sub-cutaneous or 
visceral adipose tissues, liver and colon tissues (figure 17). 

Interestingly, the expression level in the human muscle (lines 1, 2, and 3) has 
been confirmed at a level which is close to the level observed in adipose tissue, 
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which was not the case when the same experiments were performed on 
rodents. The expression of FATP is very high in the adipose tissue. In normal 
patients, the adipose tissue is sub-cutaneous. In patients suffering from obesity, 
the adipose tissue is visceral and sub-cutaneous. In these patients, the 
5 expression of FATP is higher at the level of the sub-cutaneous tissue. 

FATP is thus expressed ubiquitously in human tissues, with a predominant 
expression in adipose tissue, muscle and heart. 

10 As a transport protein, FATP would permit transportation of fatty acids in the 
adipose tissue in nutrition periods and would be involved in efflux of said fatty 
acids in fast periods. Furthermore based on the observation that FATP* is highly 
expressed in skeletal muscle, it is concluded that FATF> is very important as 
r energy source for a tissue which is known to use mainly glucose in normal 

is conditions. This result emphasizes a competition between glucose and long- 
chain fatty acids for the muscular activity. 

The third tissue which expressed a high level of FATP is heart. This expression 
is not influenced by nutrition. In brain tissue the level of FATP is significant. In 
20 addition to its activity as energy source, FATP could participate in the 
incorporation of lipids for synthesis of axial membranes and myelin. In the 
pancreas, FATP could be the specific transport protein of long-chain fatty acids 
messengers essential for the regulation of the production of insulin by the 
pancreas. 

25 

It is especially noted that the level of FATP is especially high in tissues which 
are sensitive to insulin such as adipose tissues and muscle. 

Western bl t analysis fFATP. 

30 Cells and tissues were homogenized in a lysis buffer of PBS containing 1% 
Triton X-100 (Sigma, St Louis, MO). Tissues were homogenized with a polytron 
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in extraction buffer containing PBS and 1 % NP-40, 0,5 % sodium deoxycholate, 
0,1% SDS (Sigma, St Louis, MO). In the presence of a fresh cocktail of 
protease' inhibitors (ICN, France) (100 Mg/ml AEBSF, 5 mg/ml EDTA, 1pg/ml 
leupeptin, 1 pg/ml pepstatin). Protein extracts were obtained by centrifugation 
of the lysate at 4°C and then concentration was measured with the Biorad DC 
Protein colorimetric assay system (Biorad, France). 

Protein (100 pg) were separated on a 10% polyacrylamide gel according to 
Laemmli [Laemmli, 1970 #1541], transferred to nitrocellulose membrane as 
described by Towbin [Towbin, 1979#1542] (Amersham, France), and blocked 
overnight in blocking buffer (20 mM Tris, 100 mM NaCI, 1% Tween-20, 10% 
skm milk). Filters were first incubated 4 hours at 21 °C with rabbit IgG anti- 
mFATP (10 mg/ml) developed against a FATP peptide, corresponding to amio 
acids from human FATP-1, and next for 1 hour at 21 °C with a goat anti-rabbit 
IgG (whole molecule) peroxidase conjugate (Pasteur Diagnostic Sanofi, 
France) diluted at 1/5000. The complex was visualized by chemiluminescence 
using with 4-chloro-1-naphtol as reagent according to the manufacturer's 
protocol (ECL, Amersham, France). 

Northern blot analysis of different human cell lines, such as NCL-H295, THP-1, 
Caco2, chorionic cells and hepatoma cells shows that hFATP-1 can be 
detected in all these cells but to a relatively lower extent than in human adipose 
tissue (figure 18). hFATPI mRNA is relatively will expressed in Caco2 cells and 
in cells of hepatic origin. 
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CLAIMS 

1. Nucleotide sequence which comprises a sequence involved in the 
expression of the human Fatty Acid Transport Protein (hFATP) comprising the 
aminoacid sequence of Figure 2 or Figure 5. 

2. Nucleotide sequence according to claim 1, which comprises a 
sequence encoding the human FATP, corresponding to or comprising the 
nucleotide sequence of Figure 1 or Figure 3. 

3. Nucleotide sequence according to Claim 2, which is the nucleotide 
sequence of Figure 1 or Figure 3. 

4. Nucleotide sequence according to Claim 2, which comprises an ORF 
sequence located, between nucleotide 23 (ATG) and nucleotide 1963 (TGA) of 
Figure 1 or Figure 3. 

5. Nucleotide sequence hybridizing in high stringency conditions, with a 
probe comprising 50 to 300 nucleotides including at least 6 t preferably at least 
9 nucleotides from the following sequence : 
CGGGGAGACGGGACGTGAAGGG, 

6. Nucleotide sequence according to anyone of claims 1 to 5 which is a 
genomic DNA. 

7. Nucleotide sequence according to anyone of Claims 1 to 6, which is 
the gene coding for the hFATP. 

8. Nucleotide sequence according to anyone of Claims 1 to 3 or 6 to 7, 
which comprises a transcription initiation site 61 bp upstream from the ATG 
codon. 

9. Nucleotide sequence according to anyone of Claims 6 to 8, which 
contains 12 exons and 1 1 introns having the following structure: 
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167 562 724 886 997 1206 1333 1471 1636 1783 
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10. Nucleotide siequence according to claim 9, which comprises the 
nucleotide sequence of Figure 4. 

11. Nucleotide sequence according to anyone of claims 1 to 10, which 
comprises sequences involved in the regulation of the expression of the gene 
coding for the human FATP. . 

12. Nucleotide sequence according to anyone of Claims 1 to 11 , which 
sequence either specifically hybridizes with one or both of the primers having 
one of the following sequences or is the amplification product obtained with the 

following sequences, or hybridizes in high stringency conditions with said 

«, 

amplification product : . , 

AAGGTCAATG AGG ACACAATGG , 
CGAGTAGGTAGTGATCGTGCAG. 

'13. Nucleotide sequence according to anyone of claims 1 to' 12, which is 
a genomic sequence coding for FATP and which hybridizes in high stringency 
conditions with a probe containing around 200 bp and being preferably derived 
from the sequence of exon 2 f said genomic* nucleotide sequence comprising a 
sequence involved in the regulation of the gene coding for the FATP. 

14. Nucleotide sequence which is a mRNA as obtained by transcription * 
of a genomic nucleotide sequence coding for the hFATP according to anyone 
of Claims 6 to 13. 

15. Nucleotide sequence according to anyone of Claims 1 to 13 which is 
the RNA sequence transcribed from the nucleotide sequence of figure 1 or 
figure 3 having 2 kb, or an RNA sequence including a nucleotide sequence of 
4,4 kb hybridizing in high stringency conditions with the sequence of figure 1 or 
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figure 3. 

1 6. cDNA as obtained by reverse transcription of the mRNA according to 
Claim 14 or 15. * 

17. Polynucleotide which is selected among the following sequences: 
5 - AAGGTCAATGAGGACACAATGG 

- CGAGTAGGTAGTGATCGTGCAG - . 

- a sequence comprising or corresponding, to sequences involved in the 
regulation of the expression of the gene encoding the hFATP, 

- any fragment derived from the nucleotide sequences disclosed in 
10 figures 1, 3 or 4, for instance by deletion mutation or insertion provided 

the essential biological properties of the native sequences* are 
maintained. : ; . 

18. Vector comprising, inserted in a site not essential for its replication, a 
nucleotide sequence according to anyone of claims 1 to 16. 

is 19. Vector according to Claim 18; which further comprises a reporter 

sequence such as the CAT gene or the luciferase gene. 

20. Vector according to Claim 19, wherein the expression of the reporter 
gene is under the control of an heterologous regulator region. 

21. Recombinant cell which comprises a nucleotide sequence according 
20 to anyone of claims 1 to 16 or a vector according to anyone of Claims 18 to 20. 

22. Recombinant cell according to Claim 21 which is a eucaryotic cell. 

23. Human Fatty Acid Transport Protein (hFATP) comprising the 
aminoacid sequence of figure 2 or figure 5. 

24. Human FATP according to Claim 23, which has an isoelectric point of 
25 8.5 and which replies to the aminoacid sequence of Figure 2 or Figure 5. 

25. Human FATP which is encoded by a nucleotide sequence according 
to anyone of Claims 1, to 16. A 

26. Human FATP which is devoid from its signal peptide. 

27.. Human FATP according to anyone of claims. 23 to 26, which is 
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glycosylated. h - ; 

28. Antibodies specifically directed against. hFATP according to anyone 
of Claims 23 to 27. \ \ 

29. Composition for the therapeutic, modulation of the intracellular level of 
5 " long-chain fatty acids, which comprises a component capable of regulating the 

expression of the gene encoding hFATP. - V * ' V 

30. Composition for the therapeutic modulation of the blood level of long- 
chain fatty acids, which .comprises ' a component capable of regulating the 
expression of the gene encoding hFATP. 

10 31. Composition for the treatment of a pathological state associated with 

a deficient regulation of the intracellular level of long-chain fatty acids, which 
comprises a component capable of regulating the expression of the , gene 
encoding hFATP. - > 

32. Composition according .to claim 30 or 31, which comprises an 
15 inhibitor of the expression of the hFATP gene. r, 

. ' 33. Composition according to claim 30 or 31, which comprises a 
compound capable of enhancing the expression of the hFATP gene. 

34. Composition according to claim 33 for the treatment of 
cardiomyopathies or diabetes. <- ,.\'s 

20 35. Composition according to claim 32 for the treatment of obesity. 

36. Method for the screening of the effect of a determined compound, on 
the expression of the hFATP protein in determined cells, which method 
comprises the step of detecting the transcription of the mRNA in the cells or cell 
extracts, after contacting the* cells with said determined compound, , in 

25 conditions enabling the interaction between said compound and said cells. 

37. Method according to claim 36, for the screening of the effect of a 
determined compound on the regulation of the expression of the hFATP protein 
in cells, which comprises: ' * 

a) measuring the level of transcription of the mRNA in cells or cell 
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extracts, wherein the cells have previously been contacted with the determined 
compound, in conditions enabling the interaction of said cells and said 
determined compound; 

b) measuring the level of transcription -of the mRNA in the same cells 
5 species as in step a) or on extracts of these cells, wherein these cells have not 

been previously contacted with the assayed compound; 

c) comparing the level of transcription obtained in steps a) and b). 

38. Method according to claim 36 or 37, wherein the cells used are 
selected among the group of liver cells, heart cells, adipose tissue cells and 
io skeletal muscle cells. 
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FIG. 1 A 

l 

CGGGGAGACGGGACGTGAAGGGATGCGGGGTCCGGGTGCG 

ATGCGoGCTCCGGGTGCG 

1 • 

41 

GGCGCGGCC7CGG7GG7C7CGC7GGCGC7G77G7GGC7GC 
GGCGCGGCCTCGGTGGTCTCGCTGGCGC7GTTGTGGCTGC . 

,4r t . . : ,. \ : .. . . 

81 

tggggctgccgtggacctggagggcggcagcggcgctcgg; 
tggggctgccgtggaccrrggagcgcggcagcggcgctcgg 

81 
121 

CGTGTACGTGGGCAGCGGCGGCTGGCGCTTCCTGCGCATC 
CG7G7ACG7GGGCAGCGGCGGC7GGCGC7TCC7GCGCA7C 
121 

161 

GTCTGCAAGACCGCGAGGCGAGACCTCTTCGGTCTCTCTG 

GTCTGCAAGACCGCGAGGCGAGACCTCTT - 

161 

201 

TGCTGATCCGCGTGCGCCTGGAGCTGCGGCGGCACCAGCG 

189 
241 

TGCCGGCCACACCATCCCGCGCATCTTTCAGGCGGTAGTG 

— - - GCGGTAGTG 

189 

281 " , 

CAGCGACAGCCCGAGCGCCTGGCGCTGGTGGATGCCGGGA 
CAGCGACAGCCCGAGCGCCTGGCGCTGGTGGATGCCGGGA 
199 

321 

CCGGCGAGTGCTGGACCTTTGCGCAGCTGGACGCCTACTC 
CCGGCGAGTGCTGQACCTTTGCGCAGCTGGACGCCTACTC 
239 

361 

CAA7GCGG7AGCCAACC7C77CCGCCAGC7GGGC77CGCG 

CAA7GCGG7AGCCAACC7C77CCGCCAGC7GGGC77CGCG 

279 

401 

319 
441 

AGTTZGT3GGGGTGTGGCT3GGCCTGGCCAAGGCGGGCAT 

Aui — — — . ; juu> — - .---i-^w 

359 • * 
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FIG. 1B 



481 

GGAGGCCGCGCTGCTCAACGTGAACCTGCGGCGCGAGCCC 
GGAGGCCGCGCTGCTCAACGTGAACCTGCGGCGCGAGCCC 
399. 

521 

CTGGCCTTCTGCCTGGGCACCTCGGGCGCTAAGGCCCTGA 
CTGGCCTTCTGCCTGGGCACCTCGGGCGCTAAGGCCCTGA 
439 

561 

TCTTTGGAGGAGAAATGGTGGCGGCGGTGGCCGAAGTGAG 
TCTTTGGAGGAGAAATGGTGGCGGCGGTGGCCGAAGTGAG 
479 



hFATPl tUTR 
hFATP' ^UTR 



hFATPl +UTR 
hFATP ' +UTR 



hFATPl +UTR 
hFATP' +UTR 



601 

CGGGCATCTGGGGAAAAGTTTGATCAAGTTCTGCTCTGGA ' hFATPl +LJTR 
641 

GACTTGGGGCCCGAGGGCATCTTGCCGGACACCCACCTCC hFATPl +UTR 
681 

TGGACCCGCTGCTGAAGGAGGCCTCTACTGCCCCCTTGGC hFATPl +UTR 
721 

ACAGATCCCCAGCAAGGGCATGGACGATCGTCTTTTCTAC hFATPl +UTR 
761 

ATCTACACGTCGGGGACCACCGGGCTGCCCAAGGCTGCCA hFATPl +UTR 
801 

TTGTCGTGCACAGCAGGTACTACCGCATGGCAGCCTTCGG hFATPl rUTR 
841 

CCACCACGCCTACCGCATGCAGGCGGCTGACGTGCTCTAT hFATPl +UTR 
881 

GAGTGCCTGCCCCTGTACCACTCGGCAGGAAACATCATCG hFATPl +UTR 



921 

GCGTGGGGCAGTGT' 



:tatgggctgacagtcgtcct h FAT PI +UTR 



961 



1001 

atcaagtacaac: 



•GT3GTTCAGTACAT: 



hFATPl -rUTR 
hFATPl -UTR 



hFATPl 1 -UTR 
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FIG. 1C 

1081 

GAGGCSACACC3CGTGCGCCTGGCGGTGGGGAACGGGCTG 

^vj av -*utn iL. sjuu/Ujvj/\b i i ^n^curtu^uu I TCGGCG 

1161 ' 

f ACGC C AAATCC3GGGAGTTCTACGGCGCC ACCGAGTGCAA 
1201 



hFATPl - -UTR 
hFATPl -UTR 
hFATPl -UTR 



:TGCAGCATTGCCAACATGGACGGCAAGGTGGGCTCCTGT hFATPl -UTR 



1241 



GGtTTCAACAGCCGCATCCTGCCCCACGTGTACCCCATCC hFATPl -UTR 



1281 



GGCTGGTGAAGGTCAATGAGGAGACAATGGAGCTGCTGCG hFATPl -UTR 



1321 



GGATGCCCAGGGCCTCTGCATCCCCTGCCAGGCCGGGGAG hFATPl tUTR 



1361 



GGTGGCCTCCTTGTGGGTCAGATCAACCAACAGGACCCGC 



1401 



TGCGCCGCTT.CGATGGCTATGTCAGCGAGAGCGCCACCAG 



1441 



CAAGAAGATCGCCCACAGCGTCTTCAGCAAGGGGGACAGC 



L481 



1521 



1561 



1601 



C-TGG7GAGCCGGGTGCTGGGCCAGACAGAGGTGGCCGTCT 
1641 

ATGGGGTGGCTGTTCCAGGAGTGGAGGGTAAGGCAGGGAT 



hFATPl +UTR 
hFATPl ^ +UTR 
hFATFl -UTR 



GCCTACCTCTCAGGTGACGTGGTAGTGATGGATGAGCTGG hFATPl -UTR 



GCTAGATGTACTTCCGGGACCGTAGCGGGGACACCTTCCG hFATPl -UTR 



:TGGCGAGGGGAGAACGTCTCCAACAGCGAGGTGGAGGGC hFATPl -UTR 



hFATPl -UTR 
. hFATPl -UTR 



1531 

3GC3GGC3TCGGAGAGCCCCACAGCG7GG7GGAGCCCAAC hFATPl -UTR 
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FIG. 1D 

1761 

CCCGGGCCATCT7CCTGCGCGTCCTGCCCCAGG7GGACAC hFATPl -rUTR 
1801 

GACAGGCACCTTGAAGATCCAGAAGACGAGGCTGCAGCGA ' hFATPl +UTR 



1841 

GAGGGCTTTGACCCACGCCAGACCTCAGACCGGCTCTTCT hFATPl +UTR 
1881 

TCCTGGACCTGAAGCAGGGCCACTACCTGCCCTTAAATGA hFATPl +UTR 
1921 

GGCAGTCTACACTCGCATCTGCTCGGGCGCCTTCGCCCTC ' hFATPl +UTR 
1961 

TGAAGCTGTTCGTCTACTGGCCACAAACTCTGGGCGTGGT hFATPl +UTR 
2001 

GGGAGAGGCCAGCTTGAGCCAGACAGCGCTGCCCAGGGGT . hFATPl +UTR 
2041 

GGCCGCCTAGTACACACCCACCTGGCCGAGCTGTACCTGG hFATPl +UTR 
2081 

CACGGCCCATCCTGGACTGAGAAACTGGAACCTCAGAGGA hFATPl +UTR 
2121 

ACCCGTGCCTCTCTGCTGCCTTGGTGCCCCTGTGTCTGCC hFATPl -UTR 



2161 

TCCTCTCCCTGCTTTTCAGCCTNTGTCTGCTTCCATCCNT hFATPl +UTR 



2201 

GTCCCTGTNTGGCCTTAACCCG 



hFATPl rUTR 
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AIG 033 GOT OOG G3T OX OX GXiGGC X3 GX TO3 CX OX CX TX IX v3G CE hFATPl 
M R A P G A G T A S V A S L A L L W F L 
AIG OX GdL GOT CIA GA CCA AAXC 1^ GIG GXICA CIG fULCSj CTTTGG CIS rnFATP 
M R MM P G A, G T A S V A S L [gJ L L W M L _ 
AIG QXSTOOS 03AGCA G^ AA XC I^.GX CXCICA ITO t3? CX CTT IX felT CX r FATP 

61 G L P W T W S A A A . A GH Ci] V Y V G S G G 

G3G CIG 033 IG3 2CC 103 AX G33 GCA 033 G33 CICGOC GCG IPC G3G G33r£C 033 GIT hFATPl 
G L P W T W S AAAAFCVYVGGG G 

CTT 003 IG3 AX TX AX OX GCG 033 033 TIC ami GIG TEC GIG G3TQX 033 GX mFATP 
G L P W T W S A A A A F \GJ V Y V G S G G 
GGACTTOX IGSAX iaS AX CX3 rFATP 

121 W R F L R I V C K T A R R D L F G L S V 

TGG 030 TX CIG 030 AX GICTXAAGAX GX AXCGACAC C3C TXQ3TCX XT GB3 hFATPl 

W.R F L R I.V CK T A R R D L F G L S V 
TG3 033 TIT CIG Q3T AX GTC TX AAG A33 033 A33 OGA (AC CE TD 1 G33 CIC XT GET mFATP 
WR FLR IV CK TARRDL F GLSV 
103 OA T3T CIG G3T AX GTC TXMG AG OX AX 031 (3C CEO TIT 033 CIO XT GET rFATP 

181 L I RV R LE L R R H [q] R A G S T 1 p OD 

CIG AIC 000 GIG 030 CT3 G¥3CXA3GOX CtC OGCET GOO GOO AOCAX 003 UC hFATPl 

L I RVR L £ ' L R R H R RA G D T I P C 
CIG AIT OGT GTT 033 CIA ®G CE CGAOA C7C 0330GA (IA OA GEO MS AX OX ;KL mFATP 

L I R V R L E L R R H R R A G D T I P |R| 
CIG AIC 030 GIG 033 CIA G¥3 CIA 03A 031 C7C 003 OGA GOV OGA CSC AOS AX OGA uT rFATP 

241 I F Q A V [vJE * QP ERLALVD A |g|| tI 

AIC *nT C7G 003 GZA GIG (AG OA CAG GX G?G 000 CIG 033 CIG GX GKTQX 033 AC hFATPl 

I F Q A V A R R Q P E R L A L V D A S S c a td 
AIC 1IC C3G GOT GIG OX 033. OGA CAA GA GZG 030 CIG CIA CIG GX GftOGX AST AST HlrAlr 

I F QAVa[qJRQ P E R LA L VDAS S catd 
AIC TIC C?G 000 GIG OX tTCOA CAG OX SG OX CIG OX CX GIA GC033 AJT AT rrA1 * 

301 gFeIc w t f a ql d(a]y S N A V A n'l F 

GXCHJlGCIGSAXTIT 033 OG CTG GC QXIECTOC MT GCG GIA OX APC CDO TX hFATPl 
GI CWTFA QL DTYSNAVANL F 

GOT AIA 103 103 AX IX 0CAC3GCFGGC AT I3£ TOO ffiT GCT GIA GX AAT CIG TE mFATP 
G I C WT F A Q L D T Y S N A V A N L F 

Q3T AIC TX T33 AX IX OCA OG CTA GPC AX-T&G TOO PKT OZT GX 000 AST CTG IX rFATP 

361 R Q L G F A P G D V. V A (71 F L E G - R P E 

030 (A3 CX GDC TIC OX 033 OX CAC G23 GX 000 HC TIC OG GPGQ3CCX GX GTG hFATPl 

RQLGFA PG D VVAVF.LEGRPE 
XXC &G CX G3C TIT OCA OCA GX CAT GE3 GX OCT GIG TIC CE Gffi GOO GX GOG GG mFATP 
|L Q LGF AP GDVV AVF L .EG RP E 
HE CM CX GOO TTT OX CEAGX CAT GX GX OOTGIG TIC CX Gft 030 CX OX GG rFATP 

421 FVGLWL G L A KAG 1 M II E I A A L L N V 

TICGrGOXCIGX3CX 030CXXCAC GX 030 AIG GG GX OX CIG CX A=C GX hFATPl 

FV G LW lgl;a K AGVV A A L L N V 
TTCGXOACIGIDGCXaXCXXCiVGaX GGT GIG GIG CET GOT CTTCX PXT GX mFATP 

FVG LW LG L A K AG V V A AL LN V 
TIC GX CIA CIG T33 CX 030 OX XO £PG GX OGTGIA GIG OCT OX rXT GX rFATP 

481 W L R R E P L AFC L G T S fT| A K A L I 

aac cx ax coo c?g ax oggxttox: cxox/ootx terccrA?GGX cx ax hFATPl 

NL RRE PL A F C LG-TS A AKAL I 
AAC CT3 r£G 03G GW3 OX CTG GX T?C IX CX O30A3A XA GOT GX AA3 GX CX ATT mFATP 

N L . R R E PLAF C LG-TS A AKAL I 
.VCCXAXCG3CA3CXCrT0XT?CX:TX 030 ACA XA OCT OX AA3GX OX AIT rFATP 

541 [7] G G E M I!/] A 'a V A E V S fcTI [TI L G K £ L 

TTTGG^CX^C^AIG&GdX3XG?G3X G^ GTIG AGO GX GST CX G33 A^A PGT OX hFATPl 

V G G E M A A A V A Z V S E Q L G S L ■ 

"3T 330 GX GtG AIG GCA GX OX C?3 3X GrG GIG AGO G^ CrG CX 033 A^G PGC OX mFATP 

V 3 G E M A A A V A E V 3 E C L G K J L 

l^r GX GX GtG AIG GA GX GX GTG 3X G?G GIGA3T C?G CG CX G33 AA3 KSZ CX rFATP 
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60 if?] K FCSGDLG P E [g] I L P D T [5] L L 
aIcMTX TXXTGGl GCTEQ3GGX GrG 32AIC TO GX GrC AXS? OC CE 
LKF'CSGDLG PESILPDTQ LL 

cx»g^: tgc TTcaGflrcxcEGcnr g?g 201331: og cetgscmscg oc ce 

L K F C S GD LG P E S [v] L P DT Q L L 
CIC/rG TX TXXTG^ GnTCX QGG CUT G?G AXGX CX OCT CSC AX OG OT CSS 

661 D P [Ij L 0 E A fsl T HI P L A 0 IT1 P [j] K G M 
©C CX CX CX A£ GG OX TO £CT gE GX TIG G3, G& AX OX SlJ AJG GX AE 

DP MLA EA PT T PLA QA PGKG M 
GfiCGX A3E CTT QCTC5G GXOX JCC3CA CCG CIGGCACAA GX GCAGXAA3 CUT AC 

DP K L : A EA PTT P LA QA P GKG M 

cac goc ax err err csg ens ox ice jca ox cig cx\ og gx cm gx aag or ae 

A A X 
XT GDC ATT 
A A I 
GX ATT 
A A I 
333 GX AIT 




781 V V 
GIC GIG 

V V 
GIG GIG 

V V 
GIG GIG 



S 

C?CAX 
K S 

CC AX 
H S v 

CCAX 



ryyrTmIaafghh 

AX UC CDC ATG GA GX TIC OX OC OC 
R Y, Y R I A A F G H H 

AX TEC •ECGXArraT OXTITGX aC OT 
RYYRIAAF GHH 

AX TPC T?C OX ATC CEA GX ticgx oc CAT 



S y GO m 

GXTacax aig oS 



S Y S M R 
TXTACAX AIG OZT 

S Y S M R 
ICCUC AX AIG QGG 



hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 

rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 



III G 



841A A D V L Y D- C L P L Y H S A G N , I 
GX GCT CSC GIG CIC THT OC TX CTG OX CIG TPCCPC TX GCA Q3VAM:klC 
A A 

arp 

A |N| D: V L Y D C L P L Y H S A G N I M G CXTO 
QX GAT GIG OC IHT CSC TX CTA OCT CSC TRC CftCTEA GJi GX AAC AX AIG OX rrAir 



ab hFATPl 

DV-LYDCLPL. YHSAG NIMG mPATP 
GST GIG dC THT QC TX CTG OCA CE TRCCEC XT GCA.QX AftCAX AIG G3T mrAir 



901V G Q C |lJ I Y G L TVVLR KKFSA S 
GK5GX OG TGTCIC AX TXTGX CTG2CA GXGICCICGX PfG AM.TICTX GX AX 
VG Q CV I Y GL TV V L R K K F S A S 
GIG GX CSG TGC GXA3C IRC GX TTG/OG GTE GIACIG QX #G AA3TICTX GX AT 
V G QCmiYGLT VVLR KKFSA S 
GIG OACG TGTfeCAX IPC GX TTA AE GE, GIACIG CDC WG AA3 TIC TX GX KC 



961 R F 



D D C 



HJ K 

&CAA3 



V 



Y 

OX TIC TX G?C OC TGT SCAA3 TAG 

RFWDDCV KY 
OX T1C TX GnT OC 1GT GIC AA5 TACjOT TX ACCjGEV 

RFWDDCVKY 
OX TX TX C?C GPC TGT GIC AM. TAT 



N C T 
NC TXAOtfGIG 
N C T 

TX 
N C T 
TX 



V 



V 



V Q Y I G E I 
GIT CPG TCA3CGX CM AC 

V Q Y I G E I 
GIG OG T3CA3ACXT OA AX 

V Q Y I G E I 
GIG OG TC A3C G3T GAA AIA 



hFATPl 
mFATP 

rFATP 

hFATPl 
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L201G K V G S C G r N S R I L |Pj H V V ? I R 

GX ArG GE GX TX TGT G3T TX AAC AX OX A2CCIG OX OC GIG UC CX AIC CEG hFATPI 
GKVGSCGFNSRILTHVYPI r 
■ OX &G GUT GX TX ICC OX TIC AAC AX GGT AIC CIC AX CRT GIG ISC CX AIC CUT mFATP 
G K V G S C G *? N S .R I L T H V Y ? I R 
OX rrG GX GX ICC IX GX TX AAC AT XT AIC CIC AX OC GIG TAC CX AX CUT rFATP 

L261L V K V N E D T M E \±\ L R , D {TJCs] G L C I 

CIGGrGArGGICAKTGG GrCAO^AIGGG CECIGOXGKT ffiScjCCr mfflC hFATPI 

L .V K V N E D T "M E P L R D S E G L C I 
CZG GIC AA3 GIC JOT GH3 GAC AX AIG GG OCA CIG 033 GfiC TC £2G GX CX IX AQC mFATP 

L V K V N E D T M E P L , R D S (q) G L C I 
CIG GIC AA3 GIC AftC GSG OC AX ATC GrG COVCIGAGG CSC TX tefQXCX XC AX rFATP 



1321P C Q A G E P G L^L V G Q I N Q Q D P L 

OX J3Z OG GX GQ3 GrG CCT CHT CIC CTT GE G3TOG AIC ABC C?A CPG&C CX CE hFATPI 

PCQPGE PGLLV GQINQQD PL 
aXIXCG CX033C5A OX OX CTT CE GE GX CRG AIC HC CAG GGGPC XT. CIG mFATP 

P C Q P G E P G L L V G Q.I N Q Q D P L 
OOG *SX OG OX 033 C3A OCT GX GTTCE GE GXCAG AIC NC OG CAAGC XT CE rFATP 

1381R R F D G Y. V S [¥] S A T (T) K K I A H S V 

GX OX TIC GAT GX TOT GIC AX C?*G AX GX AQC S2 AM AK3 AX GX CTC AX GIC hFATPI 

R R F D G Y V S D S A T N K K I A H S V 
OX G3T TE GOT GGT TAT GIT AGT GAC AGT OX AX AAC AAG AH3 AIT OX C7C A3C GTT mFATP 

RRFDGYVSDSAT N KK IAHS V 
OX GX TIC GOT GX TOT GET AGT CAC AX GX AGO AAC NG A3T GX CAC AX GD3 rFATP 

1441F [s] K G D S A YL S G r D V L V M D E -L G 

TICSXTArG GX GEC AX GX ISC CTC TEA G3T GBCGIG CIA GE AE GOTG3G CfG GX. hFATPI 

FRK GDS AYLSG*-DVLVMDE— LG to 

TIC GGAAA3QX ffiTAX GX TBC CTC TEA GGT CSC GIG CDV GE AX GC GK3 OG GX mFATP 

F R K G D S A Y L S G D V L V M D E L G CA _ 

TTC CEA AJG OX CTC AX GX TJC CTT TA G3T G7C GIG OA GE AE GrC GG CIG GE rFATP 
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1501Y MYFR DR SG D TFRWR G 
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Y M Y F R D R S G D T F R W R G 
UC AIG T7CT TTC OGT G?C GX AX GGG CTC AX TIC OX EG OX GX 

Y M Y F R D R S G D ; T F R W R G 
TAC AIG UC TIC GGT QC GX AX GGG GKT AX TIC OGA 1G3 CX OX 

15630 T E V E [g] V L S R L L G Q T D V A V Y 
. AHT AX GrG GIG GAG GX GIG CE A3C GX CE CIGGX CAG AGA G?C GIGGX GIC TAT 
TT E VE A V, L S R L LG Q T DVAV Y 
AX AX GPG GIG OA GX GIG CE AGC GX CIA CIGGX OG AOS GPC GIG OCT GIG W 

TT EVE A V LS R L LGQ T D V A V Y 
AX AX GrG GIG C^A GX GIG CE AGC OX CE TIGGX OG AGG C?C GIG OCT GIG TAT 

1621G V AV P G V EG K A G M A a[V]a D P H 
OX GIG G3T GTT CEA GGA GIG GrG XT AEG OCA OX AIG GX GX CIC XA GOT OX OG 

GV AV P GV EG K AGMA A I AD P H 
OX GIG GCT GIG CDV G^ GIG G?G GGG ArA Q3 1 GXAIG G3V GX AX CTAGOT GX OC 

GV.AVP GV EG K S G M A A I AD P H 
G3"l GIG G2T GIG GCA GIG GG G3G PrA AX GXAIG GX GX ACT CXAGPC QX OC 
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1801 K T R L Q R E G F D P V R n T S D R L F r 

' ^^^T^^^^T^^^'^^^^™^^^ hFATPl 
K 1 K L g R E GF D P R OT S DR T P p 

K. I K L» Q R fi G ,F D P .R. Q T fi D R T XT P 

AAG AX OGft CIAGG OST GftAGXTTTGC CCC GCTOG £X TO, G?C GGGCX T3C TO fFATP 

1 ' 61 ? 5 4^4^^^^^^™ Sac SSar £ hFATP 1 

L D L,K Q G R Y V P L " D* E R V H A R T r 

L D L K Q G R Y.ILI P L ,D E R V H A R I C 
CIA GRC CJG AAA OG G3V G3C lTa QGC C1G CATGK3 pg^ qe Off GOCOCC ATC TEE r ^ATP 

192l[?] G 0 F[AlL . 
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FIG. 3 



hsFATPT cDNA 

Start Codon 

CGGGGAGACG GGACGTGAAG GGATGCGGGC 7CC7GGAGCA GGAAGAGCCT CTGTGGCCTC 60 , 
ACTGGCGCTG CTTTGGTTTC 7GGGACT7CC GTGGACCTGG AGCGCGGCGG CGGCGTTCTG 120 
TGTGTACGTG GGTGGCGGCG GCTGGCGCTT TCTGCGTATC GTCTGCAAGA CGGCGAGGCG 180 
AGACCTCTTT GGCCTCTCTG TTCTGATTCG TGTTCGGCTA , GAGCTGCGAC GACACCGGCG 240 
AGCAGGAGAC ACGATCCCGT GCATCTTCCA GGCTGTGGCC CGGCGACAAC CAGAGCGCCT 3 00' 
GGCACTGGTG GACGCCAGTA GTGGTATATG CTGGACCTTC GCACAGCTGG ACACCTACTC 360 
CAATGCTGTA GCCAACC7G7 TCCGCCAGCT GGGCTTTGCA CCAGGCGATG 7GG7GGC7G7 420 
GTTCCTGGAG GGCCGGCCGG AG77CG7GGG ACTGTGGCTG GGCCTGGCCA AGGCCGGTGT 480 
GGTGGCTGCT CTTCTCAATG TCAACCTGAG GCGGGAGCCC CTGGCCTTCT GCCTGGGCAC 540 
ATCAGCTGCC AAGGCCCTCA TTTATGGCGG GGAGATGGCA GGGGCGGTGG CGGAGGTGAG 600 
CGAGCAGCTG GGGAAGAGCC TCCTCAAGTT CTGCTCTGGA GATCTGGGGC CTGAGAGCAT 660^ 
CCTGCGTGAC ACGCAGCTCC TGGACCCCAT GCTTGCTGAG GCGCCCACCA CACCCCTGGC 720 
ACAAGCCCCA GGCAAGGGCA TGGATGATCG GCTGTTTTAC ATCTATACTT CTGGGACCAC 780 * 
CGGGCTTCGT AAGGCTGCCA TTGTGGTGCA^ CAGCAGGTAC TACCGCATTG CTGCCTTTGG 840. 
CCACCATTCC TACAGCATGC GTGCCGCCGA TGTGCTCTAT . GACTGCGTGC CACTCTACCA 900 
CTCTGCAGGG AACATCATGG GTGTGGGGCA GTGCGTCATC TACGGGTTGA CGGTGGTACT 960 
GCGCAAGAAG TTCTCCGCCA GCCGGTTCTG GGATGACTGT GTCAAGTACA ATTGCACGGT 1020 
AGTGCAGTAC ATAGGTGAAA TCTGCCGCTA CCTGCTGAGG CAGCCGGTTC GCGACGTGGA 1080 
GCAGCGACAC GGCGTGCGCC TGGCCGTGGG TAATGGGCTG CGGCCAGCCA TCTGGGAGGA 1140 
GTTCACGCAG CGCTTCGGTG TGCCACAGAT CGGCGAGTTC TACGGCGCTA CCGAGTGCAA 1200 
CTGCAGCATT GCCAACATGG ACGGCAAGGT CGGCTCCTGC GGCTTCAACA GCCGTATCCT 1260 
CACGCATGTG TACCCCATCC GTCTGGTCAA GGTCAATGAG GACACGATGG AGCCACTGCG 1320 
GGACTCCGAG GGCCTCTGCA TCCGGTGCCA GCGGGGGGAA CCCGGCCTTC TCGTGGGCCA 1380 
GATCAACCAG CAGGACCCTC TGCGGCGTTT GGATGGTTAT GTTAGTGACA GTGCCACCAA 144 0 
CAAGAAGATT GCCCACAGCG TTTTCCGAAA GGGCGATAGC . GCCTACCTGT CAGGTGACGT 1500 
GCTAGTGATG GACGAGCTGG GCTACATGTA TTTCCGTGAC CGCAGCGGGG ACACCTTCCG 1560 
CTGGCGCGGG GAGAACGTGT CCACCACGGA GGTGGAAGCC GTGCTGAGCC GCCTACTGGG 1620 
CCAGACGGAC GTGGCTGTGT ATGGGGTGGC TGTGCCAGGA GTGGAGGGGA AAGCTGGCAT 1680 
GGCAGCCATC GCAGATCCCG ACAGCCAGTT GGACCCTAAC TCAATGTACC AGGAATTACA 1740. 
GAAGGTTCTT GCATCCTATG CTCGGCCCAT CTTCCTGCGT CTTCTGCCCC AGGTGGATAC 1800 
CACAGGCACC TTCAAGATCC AGAAGACCCG GCTGCAGCGT GAAGGGTTTG ACCCCCGTCA 1860 
GACCTCAGAC AGGCTCTTCT TTCTAGACCT GAAGCAGGGA CGGTATGTAC CCCTGGATGA 1920 
GAGAGTCCAT GCCCGCATT? G7GCAGGCGA CTTCTCACTC TGAA GCTGTT CCTCTACTGG 1980 
CCACAAACTC TGGGCGTGGT GGGAGAGGCC AGCTTGAGGC AGACAGCGCT GCCCAGGGGT 2040 
GGCCGCCTAG TACACACCCA CCTGGCCGAG CTGTACCTGG CACGGGCCAT CGTGGACTGA 2100 
GAAACTGGAA CCTCAGAGGA ACGCGTGCCT CTCTGCTGCC 7TGGTGCGCC TGTGTCTGCC 2160 
TCCTC7CCCT GC7777GAGC 27N7G7C7CC 77CCA7CCN7 G7CCC7G7N7 GGGC77AACC 2220 
, ; • * 2222 

Stop Codon 
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hs FATP1 

FIG. 4A 

ATGCGGGCT: ZGGG7GCGGG CGCGGCCTC3 GTGGTCTCGC TGGCGCTGTT G7GGC7GCTG 60 ' 
G3GCTGCCGr G3ACCTGGAG CGCGGCAGCG GCGCTCGGCG TGTACGTGGG CAGCGGCGGC 120 
7GGCGC77.CC 7GCGCATCGT CTGCAAGACC GCGAGGCGAG ACCTCTTG7G AGTGTTGCGG 180 
GATCCAGGGC 7GGGGGCGGG GCTGAGGGC7 C7GGGGGCCC ACGCTGCAGG GCTGGGCTTG 240 
CGGGAGGCC7 7GGAGGTGGA GAG7GAXXXX XXXXXATACG CTGAGATCTA C7C7CTGCTG 300 
7GTAA7GC7G. CCTGGTCACT GAGAGATCAG CACAAAG7TC ACATCGCCTC ATGAAAGCCT 360 
GCTGCCTGGG TCTCAGCGGG AGGCTGAGGC 7CCAGAGGCC AGGCGGGGCA GGGCACCAGT 420 , 
GACGC7G7CC CTCCGTCCTC CCTCCCAGCG GTCTCTCTGT GCTGATCCGC GTGCGCCTGG 480 
AGCTGCGGGG CCACCAGCGT GCCGGCCACG GTCTCTCTGT GCTGATCCGC GTGCGCCTGG, 540 
AGCTGCGGCG GCACCAGCGT GCCGGCCACA CCATCCCGCG CATCTTTCAG GCGGTAGTGC--600 
AGCGACAGCC CGAGCGCCTG GCGCTGGTGG -ATGCCGGGAC CGGCGAGTGC TGGACCTTTG 660 
CGCAGC73GA CGCCTACTCC AATGCGGTAG CCAACCTCTT CCGCCAGCTG GGCTTCGCGC 720 
CGGGCGACG7 GG7GGCCATC TTCCTGGAGG GCCGGCCGGA GTTCGTGGGG C7GTGGCTGG 780 
GCCTGGCCAA GGCGGGCATG GAGGCCGCGC TGCTCAACGT GAACCTGCGG CGCGAGCCCC 840 
TGGCCT7C7G CCTGGGCACC TCGGGCGCTA AGGCCCTGAT CTTTGGAGGA GAAATGGTGG 900 
CGGGTGAGGC CAGGCGTGGG CATCAGGTGG GCGGGGACCC AGGACTGGCC CCTGGGCGGG 960 
CGGGGAG7C7 GCTGCGCCCC AGGCCTCGGA AGGCGGCCGC CTGGACGTGG GCATGAGGTG 1020 
CACGGTCTGG GTATGCCCCG GGCAGGGAGT' TGGTGCATCC ' CAGGCCTCGG GAGGGGCCTG- 1080 
TCCGGCGGTG ACCATGACCC ATGTGTTGGG GACCACAGCG GTGGCCGAAG TGAGCGGGCA 1140 
TCTGGGGAAA AGT7TGATCA AGTTCTGC7C TGGAGACTTG GGGCCCGAGG GCATCTTGCC 1200 
GGACACCCAC C7CCTGGACC CGCTGCTGAA GGAGGCCTCT ACTGCCCCCT TGGCACAGAT 1260 
CCCCAGCAAG t GGCATGGACG GTGAGTCAAC TTCCAGGACA CCTCTACCCC AATGACTCAG ^1320 
GCCCCACCCC CTAACACTGT ATCTCCTGCA GATCGTCTTT TCTACATCTA CACGTCGGGG 1380 
ACCACCGGGC TGCCCAAGGC TGCCATTGTC GTGCACAGCA GGTGAGGGGC CCACAGGCAT 1440 
AATGCCC7CA GCCGCTGAGA GTGACCCAGG CATCTTGCCA GCCTGACCTG CCCCTCAGCT 1500 
CCTGTGGGCA ' 7CTCCATGTT ACCCTGGGGA CAGAGAGGGC AGCTGGTGTT TCCTGAGCAC 1560 
TTGC7C7GTG 7CCAGACCAG GGCCAAGCC3 7GGACGTGTA AACTCATTGC AAGGGTCATA 1620 
ACTGC77GGG' GC7GGACAAA GGCATCACAC CATT77CACA CCATCTTGGG ACCXXXXXXX. 1680 
XXXXXXXXGG GGATTTAGGT CCAGCC7C73, CCTCCGGCTC CCCCTCCCCC 73CAGGTACT 1740 
ACCGCA7GGC AGCC7TCGGC CACCACGCCT ACCGCATGCA GGCGGCTGAC GTGCTCTATG 1800 
ACTGCCT3CC CC7GTACCAC TCGGCAGG7A C7ACGGCC7G GGTAGGGAAT GGTGGGTGGG 1860 
GGCGGGGGAC CCCTTACCAA GGCCACCC77 7GCAGGAAAC ATCATCGGCG TGGGGCAGTG 1920 
7CTCATC7A7 GGGCTGACAG TCGTCC7CCG CAAGAAATTC TCGGCCAGCC GCTTCTGGGA 1980 
CGAC7GCA7C AAGTACAACT GCACGCCTTG GCNNTCCGAA AGTGCTGAGA GNACAGGCGT 2040 
GAGC..TG7G CZZGGCChGG GCCCAGCCC7 777ATCGGTG CAGCTGCTGG CATCCCCTGC 2100 
TGCAAGC77C CAGCTTCCTT ATTTGTG777 3CAGCCCAAT CTTCCAGGCT GTTCT7TGTT 2160 
AGAAAATAAG 7GAT77CTTG GGCTGC7777 7G7TAGAANN GAAGTTCTAC CGAGGGTNNC 2220 
NTG777C~™ AAAAAAAAAA 7AGGNAAA77 IJGATTAGATA CTNACTNGCA 7AATANGNGA 2280 
CNGNGT73GG NNC77CCANA GAT77GCGCC CC7ATGCGGA TGGCACATCT GGTCC7CACC 2340 
GGT7777CAC GG7C7ATTTA GATCAGATAC AGCC7CCC7G NTAGCT7GTN 7G7AAAAATC 2400 
CTTGCC..77 777777TTTC TGAGGCGGAG 7C7CAC7CT7 GTCCCCXGGC 73GAGTACAA 2460 
7AGAGCGA72 77GGCi.ACT 3CAACC7C7G CC7C7CGGG7 7CAAGAGAT7 C7CC7GCC7C 2520 
AGCC7CCCAA G7AGC7GGGA 7TACAGGCAC A7GCCACCAG GTCTGGCAAA A7777G77GT 2580 
ATT777G7AG AGATGAGGTA 7TACTACA77 . 332 C7 AGGCTG GTC7CGAAC7 ■ 7C7GACC7GA 2640 
NGTGA7373C *7IACC7NGGN CCCCCCAAAG 73C733GAKN ACAGGCATGA GZZkZGGXGZ 2700 
CCAGCCA7G7 3CCCANT777 7777777777 773AGA7GGA G7C7NGG7!;7 737C3CCCCA 2760 
??EI 3 l"t" C " rC " "-^"^CG AAN773G:;C7 AA77:n;AAGC 7CCGCC7CCC 3GG7:rTACGC 2820 
w.-.7 Zl'ZZ C"7::AGCC7C "GAG7A337 33GA77ACAG GAG7GTGCCA "ACGICC73G 2880 
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FIG. > 4B 

NTAATTTTTT GNAT77777A GTAGAGACGG ' GG777NACCA TGTTAGCCAG GATGG7C7CG 2940 
ATCTCCTGAC NTNG7GA7CC ACC7GC777G NCN7CCCAAA GTGCTGGGAT . NACAGGCTTG 3000 
AGCCACCCCG CCTGGCCGTG CCCATTCTTA AGCCAG7CA7 A7TGGCNTGA GGAGGGGCTG 3060 
GGGTGATTTN CCAGGTTNTG' TCGGTAGAGA TNTTGTTGGG TGAACCANAC CC7AAGGCAG 3120 
TGGCCCAATT NTGNGAATGG GGGNGGGGAC CTTCCCCCCC TNNGACCAGG TGTTTGTTCT 3180 
GCAGCCAG7G A7C77GGG7G CATGCATTTC TTNTCTGGGC T7CAG7GC77 CA7C7CCAAG 3240 
NTCAGCTTCA GAGCATGGCG GTGGTGTCTG CTGATCATAT TGGTTTAGTC ATCNGCNNNT 3300 
TNGTCAGNAC NN7C77C7GG GGGANNAAGG GGGNNCAGGG GTGCTCTCAG TGACGTTTGA 3360 
GCTGAGACTG AAGAG77CAG GCAAANGCA7 CTGGGGGAAG ACTGCTTCAG GCAAAAGGCT 3420 
AGCATGTGCA AAGTCCC7GA GGAGGCAGAG TGTTCAGCGG 1 AGATGGAGGG AGAGGAGGCA 3480 
GCAGGGGCGG CAGGGCCAGA TCCAGTTGGC CACAGGTAGA AATGTGGGTT TCATCCTGTG 3540 
TGTGAGCGGG AAGCCCCGAG CACANTTGCA GC7GN7A7GA* 7CCCA7C7GG TGGGTAATCC 3600- 
CCTGGCANGC NANGGNTTCA TTCCCATTNT GGGGGCCGGG ATGCCTGGGT TCAAATCAAA 3660 
TTCTGCCACT TCCCAGC7CN GTGATCTCGT TCCAGTTCCT TACCATCTGT GAGCCTCGGT 3720 
TTCTCCATCT GTAAAGTGGC GGGGGGGGAG TCATTTCCGA AGCCCAACTG GAGATGAGAA 3780 
NTNTTAAAGG CACCGTGGGG CACCTAGTGG ATGACCCCAG GAAGNTCTTG CGGGGCTTGA : 3840 
, GAATGTGGCA GGGAGGGAAG ATGTCCTGCA GGGGGAGCCC ATGAGCAGGT GGCTCCGTGG 3900 
GCACCTGGGG ■ NTCAGTCCAG ACGCAGCGCA GCCTAGTGGG TGCACCTGGG AGTGGTCTGC 3 960 
CCAGGGGTGA CGCTCCGGCC AGCCCAGGCT CAGCCAGGGG GCTCTGGGTG ACAGTGTAGG 4020 
C7CCAGGC7G AGNGTTGCCT TTNGGGGTGC CAGNTCTCCT GCTAGTCCTG CCCTGTGTGC 4080 
CTGCTGAGGC TCAGACAGGG AGTGGTATCT CCAGGAAGCT GCTTGGCAGA ATCGAGGGAC 4140 
CACGAAGGGG TGCGGGTACG GGGAGGAGGA AAGACACTGC ATCTGCTTGG ATAGAAGTTT 4200 
GCATCCGGCC AGGCACAAGT GGCTCACACC TGGAATCTCA CACTTTGGGA GGCTGAGGCA 4260 
GGAGGATTGC TTGAGGCCAG GAGTTCAAGA CCAGCCTGGG GAACAGAGCG AGATCCCCGA 4320 
TCTGTACAAA AAAATCTGTA GTCCCAGCTA CTCGGGAGGC TAAGGCTGGA GGATCACTTG 4380 
AAGGCACCCC G77AGAGGC7 GCAGTGAGCT ■ GTGATCATGC AAGTGCATNC AGCTNGGGGG 4440 
GGGCAGGGGG AGACCCCCCC CCNAAAAACA ACAAAAAAAG CTGCATCNTA GACCCTTTGC 4500 
AAGAGACTGA ACGAGTCCTA GGAGTCAATG TGGTGCC7AA TGGAGTGTGG ANGA77C7GC 4560 
AGCCATCATC A7CC77AGGC TGTTCCGCTC ATAGGATTAG " CTCCCTGGG7 GGGGCGG7C7 4620 
CGGGG7N7C7 ACC7C7GA7C CGGGC7CCCC ACCGCC7GGC GG7CCCA7GA CCCAC77CC7 4680 
CACCCCG7CC CCCAGG7GG7 7CAG7ACA7C GGGGAGA7C7 GCCGC7ACC7 GC7GAAGCAG 4740 
CCGG7GCGCG AGGCGGAGAG GCGACACCGC G7GCGCC7GG CGG7GGGGAA CGGGC7GCG7 4800 
CC7GCCA7C7 GGGAGGAG77 CACGGAGCGC 77CGGCG7AC GCCAAA7CGG GGAG77C7AC 4860 
GGCGCCACCG AG7GCAAC7G CAGCATTGCC AACA7GGACG GCAAGG7GCA CACCGGCAGG 4920 
CCCCGGGGCA GG7C7GGGAG 77CAGGGAAG ACAC77G7C7 CC7C77CC7G GGGCC7GGA7 4980 
ACA7AAAACA GGC7GGAC7G GCGCGGAAGG C7CGCAAGGC GCACGCAGGG CGG7G7AGGA 5040 
GATCTGGACT CCG7GCACAC AGAAC7C7GA GXXXXXXXXX XXXXXXXGAC 7AGC7G7AGG 5 1 0 0 
77CACACAGA GGA7C7AC77 GC7GCCAGAC 7GAGG777G7 GGATCAGGAA 'GTGGCACCAG 5160 
CCAGAGG7C7 7C7C77GAC7 GAA7GCAGGC 7GGGAAGG7G GGAGGAGGGG GCC7GAG77G 5220 
GAGGGGACGG 7TAC7ACCCT GC7T77GCAG A7CAGCCAAG GCAGGCAAAG 7G77A7GAGA 5280 
AGGACCCCGG A7A7CCCCGG G777CC7AC7 CAG77CACCC CA77CCAGG7 CGGC7CC7G7 534 0 
GG777CAACA GCCGCA7CC7 GCCCCAGG7G 7ACCCCA7CC GGG7GG7GAA GG7CAA7GAG 54 00 
GACACAA7GG AGC7GC7GGG GGA7GGCCAG GGCC7C73CA TCCCC7GCCA GGCCGGGG7G 5460 
AGCAGGGCCC GGGCA7GG7C CCCACCCGGA GCAGGGG7CC CCACACCC7G ' CCTGCCTAGC 5 = 20 
GCAGGC7GAA. CA7GGCC77C 7CCC7AGGGG AGCC7GGCC7 CZ77G7GGG7 CAGA7CAACC 5580 - 
AACAGGACGC Z-ZZGZGZZGZ 77GGA7GGC7 A7G7CAGCGA GAGCGCCACC ' AGCAAGAAGA. =€40 
7CGCCCACAG Z37C77CAGC AAGGGCGACA GCGCCZkZZZ C7CAGG7GCG GAGC7GC7AG 5 ""CO 
GCCZZGGTGA ZZGGZZG7GZ 3GA7GGGGA7 C 33 CCA 3 3 ~-. 7373CCCC7G 73CC37C7G3 "zZ 

^^.rif^.T'^.^ ******* ~ « * — » » ^ /-«rr- «-» *mrr> >w r+ « «— rm—> m -^«ff»r»«r<^«» <-"• «■«■*-» -r> « r*, r+ r-* r— ~, 
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FIG. 4C 



ACACCTTCCG CTGGCGAGGG GAGAACGTCT 
GCCTGCTGGG CCAGACAGAC GTGGCCGTCT 
TGCAGGGGGT GGTCCTGAGG CATGGTCCTG 
ACACATGCCT TTGGCAGTGC ACAACCTGGA 
CTGGATCTGG AGCCAGTTCA CCTGGGTGAT 
GGATCATGAA AGCCCACCTG TATTAGGGCT 
GTGTGGCTGC TTCCATAAAT GTCATCCCAG 
TACTGCTTGA CAGTGTATCT GGTCCTGCTG 
-CCTCTGCCTC CAGGAGTGGA GGGTAAGGCA 
CTGCTGGACC CCAACGCCAT ATACCAGGAG 
CCCATCTTCC TGCGCCTCCT GCCCCAGGTG 
AACCTCTCTC TTCATCCATC AGTGTGTCTG 
CAGAAGCCAC- CTGCTCAGCC CTTATCTGCC 
CCAGAAGACG AGGCTGCAGC GAGAGGGCTT 
CTTCCTGGAC CTGAAGCAGG GCCACTACCT 
CTGCTCGGGC GCCTTCGCCC TCTGA 



CCACCACCGA GG7GGAGGGC GTGC7GAGCC 5880 
ATGGGGTGGC TGTTCCAGGC AAGCTGGGGT 594 0 
AGGGAGCTCA GCCAAAAGGG GCTTAGAGGT 6000 
CAACTGCTCA TGGCAGCCCA GGAGGAAGCA 6060 
GTTGAGCCTC AGTTTTGTCA TCAGAAAATG 6120 
TCAATGAGCC AAGCAGGAGC TCCCCAAAAT 6180 
GTTGGGAGAG ACTGGAGATT . ACAGACCTGC 6240 
GTGAGGATGA GAGGCGGGGT GTCCTCAGCT 6300 
GGGATGGCGG CCGTCGCAGA CCCCCACAGC 6360 
CTGCAGAAGG TGCTGGCACC CTATGCCCGG 6420 
GACACCACAG GTGCGAGTCT CCCCCACTCC 6480 
TTGATTCGAG GGATAT7GAG TTGAGGCCTC 6540 
CCCCATCCCC ACTATAGGCA CCTTCAAGAT 6600 
TGACCCACGC CAGACCTCAG ACCGGCTCTT 6660 
GCCCTTAAAT GAGGCAGTCT ACACTCGCAT, 6720 
6745 
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FIG. 8B 

Intron 1 

GGCGAGACCTCTT— ? — CGGTCTCTCTGT 

Intron 2 

AAATGGTGGCGGgtgaggcc— 215—gaccacagCGGTGGCCGAAG 

Intron 3 

AGGGCATGGACGgtgagtca — 131- — tcctgcagATCGTCTTTTCT 

Intron 4 • 

CGTGCACAGCAGgtgagggg-1500-cccctgcagGTACTACCGCAT 

Intron 5 

ACCACTCGGCAGgtactacg — 68-— ctctgcagGAAACATCATCG 

Intron 6 

TACAACTGCACG — — >3kb — cgtcccccacGTCGGCTCCTGT 

Intron 7 

TGGACGGCAAGgtgcacacc — 3000 — cattccagGTCGGCTCCTGT 

Intron 8 

CCCTGCCAGGCCgtgagcag~2900~ctccctagGGGAGCCTGGCC 

Intron 9 

CCTACCTCTCAGgtgcgcag — 90 — tctgccagGTGACGTGCTAG 

Intron 10 

TGGCTGTTCCAGgtcaagct— 389 — gcctccagGAGTGGAGGGTA 

Intron 11 

TGGACACCACAGgtgcgagt — 2800~-cactatagGCACCTTCAAGA 
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